07-Xo~q\ \ 
JG§5 Fte'd PCT/PTO 1 8 JUL 200f 

09/88969? 

T RANSMITTAL LETTER FOR A PCT INTERNATIONAL APPLICATION 
ENTERING THE NATIONAL STAGE IN THE U.S. AS A DESIGNATED or 
ELECTED OFFICE UNDER 35 IJSC 371 

Attorney's Docket No.: SCHO0057 
Date: July 18, 2001 

Express Mail" mailing label number (from mail label): EL816159012US 

I hereby certify that this paper is being deposited with the United States Postal Service "Express Mail Post Office to 
Addressee" service, as required under 37 CFR 1.10, on the date indicated above and is addressed to the Commissioner of 
Patents and Trademarks, Washington, D.C. 20231. 

Typed Name: Jessica Pallach 




Signature: 

Commissioner of Patents 
and Trademarks 
Box Patent Application 
Attention: DO/EO 
Washington, D.C. 20231 

Transmitted herewith are the papers required to enter the national state in the U.S. as a designated office/elected 
office for the following PCT international patent application: 

INTERNATIONAL APPLICATION NUMBER: PCT/EP99/09979 
Int'I Filing Date: 15 Decemberl999 
1st Priority Date: 21 January 1999 

Inventory): Thomas SPORER, Roland BITTO, Karlheinz BRANDENBURG 
For: SYSTEM AND METHOD FOR EVALUATING THE QUALITY OF MULTI- 
CHANNEL AUDIO SIGNALS 

The United States Patent Office is: (select one) 

A Designated Office (No Demand was filed - See 37 CFR 1.494) 

_X_ An Elected Office (A Demand for Preliminary Examination was Filed - See 37 CFR 1.495) 
Enclosed are: 

_JL_ A copy of the international application (if this line is not checked, the international application was 
previously communicated by the International Bureau or the international application was originally filed in 
the USPTO). 

_2L An English Translation of the International Application 

_X_ An UNEXECUTED Combined Declaration and Power of Attorney 

A translation of amendments under Article 34 PCT 

— A copy of the Notification of the Recording of a Change 

— _ A translation of annexes to the international preliminary examination report 

An Assignment of the Invention to xxxxxxxxxxx (with $40.00 recordal fee) 

_X_ Information Disclosure Statement, 1449 Form and cited references 
_X_ A Preliminary Amendment — annotated copy of PCT/EP99/09979 
_X_ Clean copy of application after annotations filed as Preliminary Amendment 
_JL A copy of the International Search Report and cited references 
_X_ A copy of the References cited in the German Examination Report 
_JL_ A copy of the Preliminary Examination Report 

Revised Drawing Sheet as Amended under Article 19 PCT 

. Notification of the Recording of a Change 

X 3 Sheets of Formal Drawings 



09/889697 

JC17Bec'dPCT/PT0 18 JUL 2001 



FEE CALCULATION 



X- BASIC FEE $ 86 o.oo 

(IPEA-U.S. $670/335; ISA-U.S. $760/380; PTO not ISA or IPEA $970/485; 

U.S. IPEA all claims meet 33(1 )-(4) $96/48; File w/ EPO or JPO search report 860/430;) 

Surcharge for filing a late oath or declaration ($130/65) $ *** 

Surcharge for filing a late translation ($130) $ *** 

. Multiple dependent claims ($260/130) x $ 260 claim = $ *** 

Excess claims - see calculation below 

Total Claims: -21-20= 1 X $18/09claim = $ 18.00 

Independent Claims: - 2 - 3 = 0 X $78/39/ind. claim = $ - 



Excess Claim Total $ 
Assignment recordal fee ($40) $ 

TOTAL FEES $ 878.00 

X The Commissioner is hereby authorized to charge the filing fee of $878.00 and any additional fees or 
credit any overpayment to Deposit Account No. 07-1445 (Order No. SCHO0057). A duplicate copy of this 
transmittal is enclosed. 

**Please direct all correspondence concerning this case to the undersigned at GLENN PATENT GROUP, 
3475 Edison Way, Ste. L, Menlo Park, CA 94025. 



Respectfully submitted. 




Michael A. Glenn 
Registration No. 30,176 



Customer No. 22862 



09/889697 

JG17 Recti PCT/PTO 18 Jl 

National Phase of PCT/EP99/0997 9 in U.S.A. 
Title: System and method for evaluating the quality of 

multi-channel audio signals 
Applicants: SPORER, Thomas; BITTO, Roland; BRANDENBURG 
Karlheinz 




Final version of PCT/EP99/09979 for the prosecution at the 
USPTO to be filed as 



2 IVm 09/889697 

y I ' JC17fiec'dPCTiPT0 .18 JUL 200V 

System and method for evaluating the quality of multi-channel audio signals 



Field of the Invention 

The present invention relates to quality evaluation and in particular to a System and method 
for evaluating the quality of multi-channel audio signals. 

Background of the Invention and Prior Art 

ySince listening-adapted digital coding methods have been standardized, they have been 
[ftsed to an increasing extent. Examples for such cases of use are the digital compact cas- 
sette, the minidisk, digital terrestrial radio broadcasting and the digital video disk. When 
]<§oding is effected by means of listening-adapted coding methods, artificial products or arti- 
facts may, however, occur, which did not occur in analog audio signal processing. 

^forjudging or evaluating a specific encoder, listening test with test persons were carried 
ygut in the past. Although the average result provided by such listening tests is compara- 
tively reliable, there is still a subjective component. Furthermore, listening tests executed 
with a certain number of test persons are comparatively complicated and therefore com- 
paratively expensive. Hence, measurement methods have been developed for a listening- 
adapted evaluation of audio signals. 

Such a measurement method is described e.g. in DE 196 47 399 C1 . The method of listen- 
ing-adapted quality evaluation described in this publication models all non-linear hearing 
effects onto a reference signal as well as onto a test signal. The listening-adapted quality 
evaluation is carried out by means of a comparison in the cochlear domain. In so doing, the 
excitations caused in the ear by the test signal and by the reference signal are compared. 
For this purpose, both the audio reference signal and the audio test signal are divided into 
their spectral components by a filter bank. By means of a large number of filters whose fre- 
quencies overlap, a sufficient resolution with respect to time as well as frequency is guaran- 
teed. Hence, a mono audio test signal, which is derived from an audio reference signal by 
coding and subsequent decoding, can be evaluated with regard to its quality. 
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The measurement method described in DE 196 47 399 D1 also permits an evaluation of the 
quality of stereo signals, i.e. two-channel signals. For this purpose, non-linear preprocess- 
ing is carried out with the left and with the right channel of the audio test signal and of the 
audio reference signal; this preprocessing emphasizes transients in a frequency-selective 
manner and reduces stationary signals. In particular, different detections of the error prob- 
ability are carried out with the left channel of the audio reference signal and with the left 
channel of the audio test signal as input signals, with the right channel of audio reference 
signal and with the right channel of the audio test signal as input signals, with the left chan- 
nel of the preprocessed audio reference signal and with the left channel of the preproc- 
essed audio test signal as input signals and with the right channel of the preprocessed au- 
|§io reference signal and with the right channel of the preprocessed audio test signal as in- 
:gut signals so as to obtain a measure of the quality of the stereophonic audio test signal. 

% disadvantage of the known method for listening-adapted quality evaluation of audio sig- 
nals is the fact that the stereo ability is limited to a reproduction by headphones alone. In 
lather words, the audio test signal which enters the ear of a listener is compared with the 
J^iudio reference signal which enters the ear of a listener. This means that effects produced 
J|y a room, such as reflections on the walls, on the ceiling and on the floor, multiple reflec- 
tions, attenuations, etc., are not taken into account. Furthermore, known quality-evaluating 
methods are not able to take into account any directional characteristic of the human ear, 
i.e. it makes no difference whether a signal comes from the rear, from the front or from the 
side. Known measurement methods are only applicable to headphone reproduction in the 
case of which the acoustic signal is emitted by the headphone loudspeaker, which is nor- 
mally arranged directly on the ear, and is introduced in the ear or the quality-evaluating pro- 
cess. 

The known method is also disadvantageous insofar as a listening-adapted quality evalua- 
tion of multi-channel signals, such as e.g. 5-channel signals, which become more and more 
common and which are known under the headword "Dolby surround", has been absolutely 
impossible up to now. 
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Summary of the Invention 

It is the object of the present invention to provide an improved concept for evaluating the 
quality of audio signals in the case of which room effects are additionally taken into account. 

In accordance with a first aspect of the invention, this object is achieved by a system for 
evaluating the quality of an audio test signal derived from an audio reference signal by cod- 
ing and decoding, said audio test signal and said audio reference signal each comprising a 
plurality of channels, each channel being adapted to be made audible by one loudspeaker 
r©f a plurality of loudspeakers which are positioned at different positions in an at least ficti- 
cious room, and two listening reference points being defined with respect to the positions of 
ythe plurality of loudspeakers, said system comprising: a unit for converting the audio refer- 
ence signal into a first audio reference sum signal at the first reference point and into a sec- 
J&nd audio reference sum signal at the second reference point and for converting the audio 
Jest signal into a first audio test sum signal at the first reference point and into a second 
Qaudio test sum signal at the second reference point, the audio reference sum signals and 
|ihe audio test sum signals at the first and second reference points being a superposition of 
She respective channels, which can be emitted by said plurality of loudspeakers, weighted 
Nfcith a respective transfer function between the respective loudspeaker and the reference 
point in question; and a unit for evaluating the quality of the audio test sum signals while 
taking into consideration the audio reference sum signals so as to provide an indication of 
the quality of the audio test signal. 

In accordance with a second aspect of the invention, this object is achieved by a method for 
evaluating the quality of an audio test signal derived from an audio reference signal by cod- 
ing and decoding, said audio test signal and said audio reference signal each comprising a 
plurality of channels, each channel being adapted to be made audible by one loudspeaker 
of a plurality of loudspeakers which are positioned at different positions in an at least ficti- 
tious room, and two reference points being defined with respect to the positions of the plu- 
rality of loudspeakers, said method comprising the following steps: converting the audio 
reference signal into a first audio reference sum signal at the first reference point and into a 
second audio reference sum signal at the second reference point; converting the audio test 
signal into a first audio test sum signal at the first reference point and into a second audio 
test sum signal at the second reference point; weighting the respective channels, which can 
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be emitted by said plurality of loudspeakers, with a respective transfer function between the 
respective loudspeaker and the reference point in question; superimposing the weighted 
channels at said first and at said second reference point so as to obtain the audio reference 
sum signals and the audio test sum signals; and conducting the audio test sum signals and 
the audio reference sum signals to a unit for evaluating the quality of the audio test sum 
signals while taking into consideration the audio reference sum signals so as to obtain an 
indication of the quality of the audio test signal. 

The present invention is based on the finding that, although signals comprising an arbitrary 
COumber of channels exist, the human listener, who counts in the final analysis, always has 
34 n 'y two ears at his disposal. Directional hearing is caused by the different impulse re- 
sponses for different incidence directions of sound signals into the human ear. The different 
ftipulse responses for different incidence directions are referred to as "head-related transfer 
Junctions" in the field of technology. In reality, there are not only the direct sound paths be- 
tween the ear and the loudspeaker, but reflections on the walls, on the ceiling and on the 
pjpor occur as well. This can be summarized as room impulse response. The HRTFs and 
Ithe room impulse response lead, in combination, to a change of sound which can, accord- 
3|g to the present invention, also be evaluated by measurement systems without explicit 
Wiodelling of binaural effects, such as different masking thresholds for binaural signals in 
comparison with monaural signals, perception of phase shifts, precedence effects, etc.. 

When audio signals are evaluated by means of listening tests, standardized listening 
rooms, which have been standardized e.g. according to ITU-R BS.1 116, are normally used. 
The size, the loudspeaker arrangement and the reverberation time are largely determined in 
this case. When a more comprehensive quality evaluation of audio signals is carried out in 
accordance with the present invention, both the head-related transfer functions (HRTFs) as 
well as the room impulse responses can be taken into account. For the listening-adapted 
quality evaluation according to the present invention it is, furthermore, of no importance 
whether a signal is a stereo signal which is emitted by two loudspeakers for the left and for 
the right channel, or whether the signal is a multi-channel signal comprising e.g. five chan- 
nels and emitted by five loudspeakers which are positioned with respect to a listener e.g. in 
such a way that the loudspeakers are arranged at the rear left, front left, rear right, front 
right and at the front. 
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The quality-evaluating system according to the present invention comprises for this purpose 
a unit for converting the audio reference signal into a first audio reference sum signal at a 
first reference point and into a second audio reference sum signal at a second reference 
point and a unit for converting the audio test signal into a first audio test sum signal at the 
first reference point and into a second audio test sum signal at the second reference point, 
the audio reference sum signals and the audio test sum signals at the first and second ref- 
erence points being a superposition of the respective channels, which can be emitted by the 
plurality of loudspeakers, weighted with a respective transfer function between the respec- 
tive loudspeaker and the reference point in question. The audio reference sum signals and 
rtjie audio test sum signals are finally fed into a quality-evaluating unit so as to obtain an 
Indication for the quality of the audio test signal. The quality-evaluating unit can be an arbi- 
trary known unit of the type disclosed e.g. in DE 196 47 399 C1 or of the type specified in 
j|ie international standard ITU-R BS 1387 (PEAQ). 

3 T he method according to the present invention is advantageous with regard to the fact that, 
Jjiyhen the audio signal is a stereo signal, the influences of the listening room on the signal 
propagation from each loudspeaker to each reference point, i.e. each ear, can be taken into 
liccount. 

Another advantage is to be seen in the fact that the method is applicable to audio signals 
comprising an arbitrary number of channels, since the channels are converted into two sum 
signals via respective transfer functions modelling the propagation of a signal from one 
loudspeaker to one ear, in such a way that an arbitrary quality-evaluating method, which is 
suitable for two channels, can be used. 

Normally, the individual transfer functions can be gained by measurement making use of 
built-in microphones with an artificial head or of probe microphones with a human listener. 
The method according to the present invention will, however, be particularly advantageous 
when the head-related transfer functions of arbitrary persons are already known and can 
e.g. be downloaded via the Internet from a suitable server. In this case, the room impulse 
response which occurs in a listening room and which can be measured or simulated can be 
convoluted with a specific existing HRTF so as to obtain a transfer function. This will be ad- 
vantageous especially in cases where the listening room does not yet exist, i.e. where the 
acoustic properties of a room are simulated prior to actually constructing the room so as to 
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simulate the acoustic properties when e.g. concert halls or sound studios are planned and 
so as to optimize the listening room already prior to its construction. 

Brief Description of the Drawings 

In the following, preferred embodiments of the present invention will be explained in detail 
making reference to the drawings enclosed, in which: 

JFig. 1 shows a schematic block diagram of a system according to the present 

J3 invention; 

;Fig. 2 shows a schematic diagram for determining the head-related transfer 

yp functions (HRTFs); and 

Mng. 3 shows a schematic block diagram for representing the situation in a real 

listening room. 

Detailed Description of the Preferred Embodiments 

Fig. 1 shows a schematic block diagram of a system for evaluating the quality of an audio 
test signal derived from an audio reference signal by coding and decoding. The audio test 
signal and the audio reference signal each comprise a plurality of channels; each channel 
can be made audible by one loudspeaker of a plurality of loudspeakers 1 1 to 15 which are 
positioned at different positions in an at least fictitious room, two reference points 17, 18 for 
simulating the hearing being defined with respect to the positions of the plurality of loud- 
speakers 11 to 15. The quality-evaluating system includes a unit 19 for converting the audio 
reference signal into a first audio reference sum signal at the first reference point 17 and 
into a second audio reference sum signal at the second reference point 18 and for convert- 
ing the audio test signal into a first audio test sum signal at the first reference point 1 7 and 
into a second audio test sum signal at the second reference point 18, the audio reference 
sum signals and the audio test sum signals at the first and second reference points 17, 18 
being a superposition of the respective channels which can be emitted by said plurality of 
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loudspeakers 11 to 15, weighted with a respective transfer function 0F11 to UF52 between 
the respective loudspeaker 1 1 to 15 and the reference point 17, 18 in question. The quality- 
evaluating system additionally includes a unit 20 for evaluating the quality of the audio test 
sum signals while taking into consideration the audio reference sum signals so as to pro- 
vide an indication of the quality of the audio test signal at an output 21 . 

In the following, the conversion unit 19 will be explained. This unit comprises a plurality of 
transfer functions 0F11 to UF52, which are either the HRTFs, when an anechoic room, i.e. 
a room in which no reflections occur, is considered, or which are the whole transfer func- 
f t|on of the room from one of the loudspeakers 1 to 5 to a reference point 1 , 2. As can be 
|Ieen in Fig. 1, the output signals of the loudspeakers are weighted with the respective 
pfransfer functions. The output signals produced when the input signals are weighted with 
jfhe respective transfer functions are added by means of a first adder 22 so as to obtain first 
yaudio sum signals. Analogously, a second adder 23 is provided for the second reference 
Jloint 18 so as to add the output signals of the transfer functions from the respective loud- 
speakers 11 to 15 to the second reference point 18 so as to provide the second audio sum 
jlignals. It goes without saying that the audio test signal as well as the audio reference sig- 
nal are processed by means of the conversion unit 19 in such a way that the same condi- 
tions prevail for both the audio reference signal and the audio test signal in such a way that 
the unit 20 for evaluating the quality of 2-channel signals will only measure the quality of 
coding/decoding and that no other effects will disturb the measurement result. 

Although Fig. 1 shows the situation for a 5-channel audio signal, the system according to 
the present invention is also applicable to stereo signals comprising only two channels or to 
signals comprising three, four or more than five channels. In this case, it will only be neces- 
sary to add or to omit respective transfer functions. Furthermore, it should be pointed out 
that the positioning of the loudspeakers in Fig. 1 is only schematic. A correct positioning of 
the loudspeakers with respect to the reference points is shown in Fig. 2 and 3 for a 5- 
channel signal example. 

With respect to the notation of the individual transfer functions reference should be made to 
the fact that the first figure always refers to the loudspeaker, whereas the second figure re- 
fers to the reference point, i.e. reference point No. 1 (17) or reference point No. 2 (18). 
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Fig. 2 shows a possible arrangement of the five loudspeakers 1 1 to 15 with respect to a 
listener 24 whose head is schematically shown in Fig. 2 in a top view. Alternatively, the 
head 24 may be an artificial head. In any case, the head 24 comprises the first reference 
point 17 and the second reference point 18, i.e. the ears 17, 18 in the case of a human lis- 
tener or the built-in microphones 17, 18 in the case of an artificial head 24. In Fig. 2, trans- 
mission paths in the anechoic room from each of the loudspeakers 11 to 15 to each refer- 
ence point 17, 18 are depicted. The head-related transfer functions (HRTFs) are deter- 
mined by screening e.g. the head or the shoulders of the person listening and by different 
transmission times. Arrow 31a, for example, represents the transmission path from the first 
loudspeaker 1 1 to the first reference point 17. Arrow 31b, which is depicted in the form of a 
yferoken line in the area of the head 24, represents the HRTF from the first loudspeaker 1 1 to 
iJhe second reference point 18. Analogously, arrow 32a represents the transfer function from 
JShe second loudspeaker 12 to the first reference point, i.e. UF21 in Fig. 1. Arrow 32b repre- 
sents in a corresponding manner the transfer function from the second loudspeaker 12 to 
' Ihe second reference point 18, i.e. UF22 in Fig. 1. By adding the sub-signals of the loud- 
speaker output signals, which have been weighted with the respective transfer function, at 
Hhe reference points 17, 18, the first and second audio test sum signals and audio reference 
#um signals are then obtained; these audio test sum signals and audio reference sum sig- 
nals can then be fed into an arbitrary quality-evaluating unit 22 for 2-channel signals so as 
to obtain a measure of the quality of the audio test signal, which is a 5-channel signal in the 
case shown in Fig. 2. 

As has already been mentioned, the scenario in Fig. 2 shows how the head-related transfer 
functions are gained in the anechoic room. This means that, when the HRTFs are gained by 
measurement, the room must be of such a nature that no sound reflectors exist within the 
room, i.e. that the whole room must be provided with a sound absorbing lining. 

Fig. 3 shows a schematic representation of transmission paths in a listening room 30 in 
which the loudspeakers 11, 12, 13, 14, 15 are arranged in the same way as in Fig. 2. In 
addition to the direct sound, an indirect path from each loudspeaker to the left ear 17 is 
shown here. Reference should be made to the fact that the scenario in Fig. 3 does not fully 
reflect the reality, since reflections occur here on all the walls, the floor and the ceiling and 
since multiple reflections exist as well. In detail, the first loudspeaker 1 1 additionally emits 
sound which, as shown by a line 31c, is reflected on the front wall of the room 30, propa- 
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gates from the front wall and arrives at the first reference point 17. It follows that the transfer 
function from the first loudspeaker 1 1 to the left ear 17, i.e. 0F1 1 in Fig. 1 , models not only 
direct sound propagation 31a from the loudspeaker to the ear but also sound propagation 
by means of a reflection 31c from the first loudspeaker 1 1 to the first ear 17. Analogously, 
there is also an indirect path, which is indicated by an arrow 32c, from the second loud- 
speaker 12 to the first ear 17. This means that the transfer function 0F21 in Fig. 1 from the 
second loudspeaker 12 to the first reference point 17 models not only direct sound propaga- 
tion 32a but also sound propagation by means of reflection to the first ear 1 7. 

In the following, the determination of the individual transfer functions 0F11 to 0f52 (Fig. 1) 
Jjyill be explained. There are various possibilities of determining these transfer functions. 

yThe first possibility is to position the loudspeakers 11 to 15 relative to the reference points 
^7 and 18 in the manner shown in Fig. 3. Subsequently, the first loudspeaker 1 1 is excited 
Mly means of an excitation signal, whereupon the sound signal arriving at the first reference 
[point 17 is measured at this reference point; considering Fig. 3, this sound signal is a su- 
perposition of the signals 31a, 31c. In addition, the sound signal at the second reference 
ypoint 18 is measured; this sound signal could be a superposition of signal 31b and of a sig- 
nal which is not shown in Fig. 3 and which is emitted by the first loudspeaker 1 1 and re- 
flected on some wall or other in such a way that it arrives at the second reference point 18. 

The transfer function from the first loudspeaker to the first reference point 1 7 (UF1 1 in Fig. 
1) can be calculated from the excitation signal and from the sound signal measured at the 
first reference point 17. If the loudspeaker 1 1 is excited by means of an ideal pulse, the re- 
spective impulse response, which describes the transmission of the sound signal in the time 
domain, will be obtained directly at the reference points. In view of practical restrictions, this 
is, however, only a theoretical method, whereas, in practice, the loudspeaker 1 1 is excited 
by a pseudo-noise signal. This method is repeated for the additional loudspeakers 12 to 15 
in such a way that all the additional transfer functions UF21 to UF52 can be determined 
from the measured sound signal at the respective reference point and from the excitation 
signal at the respective loudspeaker. 

If, as has been stated, such measurements take place in a real space with non-absorbing 
walls, etc., the whole transfer function, which comprises the room impulse response and the 
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head-related transfer functions (HRTFs) for the individual loudspeaker positions, will be 
determined directly. If such measurements are carried out in an anechoic room, i.e. in a 
fully sound-absorbing room, the HRTFs can be determined directly in this way; these 
HRTFs are then the transfer functions 0F1 1 to 0F52. 

Irrespectively of whether the measurement is carried out by means of two built-in micro- 
phones and an artificial head or by means of two probe microphones and a test person, 
such sound measurements are complicated and expensive not least in view of the very ex- 
pensive probe microphones. 

|ff, however, head-related transfer functions (HRTFs) are known for specific persons or also 
ribr an "average person", these head-related transfer functions can be used for being convo- 
luted with the impulse response of a room; this impulse response can also be simulated. In 
ylhis case, no measurements will be necessary for determining the transfer functions 0F1 1 
Jb UF52. A substantial advantage of this method is that it can also be used for simulating 
hboms which have not yet been constructed so as to design e.g. a sound studio for an opti- 
mum sound propagation for specific loudspeaker configurations prior to actually construct- 
ing this sound studio. It follows that, in this case, it can no longer be said that the room in 
Which the quality of a coded and subsequently decoded audio test signal is to be evaluated 

actually exists. Instead, the room only exists in a simulated form and is therefore a fictitious 

room. 

Irrespectively of whether the room actually exists or whether it only exists as a fictitious 
room on the basis of a simulation, it is normally assumed that test persons are seated or 
stand in such a listening room, which may e.g. be a standardized listening room, at the best 
possible listening position. However, many test persons move their head to the front, to the 
rear, to the left or to the right while the test is taking place; this is also referred to as transla- 
tion. In addition, the persons will normally move slightly away from the optimum listening 
position, i.e. they move their heads to the left and to the right, this being also referred to as 
bearing movements or rotation. Hence, a possibly existing middle loudspeaker, i.e. the 
loudspeaker 13, will no longer be located precisely in the middle. This happens because the 
directional perception is often unsure precisely at the front. In particular, the front and the 
back are confused in many cases. This is also referred to as "front-back confusion" in the 
field of technology. Making reference to Fig. 2 and 3, it can be seen that the first reference 



11 



point 17 and the second reference point 18 change with respect to the fixed positions of the 
loudspeakers in the case of each movement of the head. 

In order to cope with this situation, the quality-evaluating method carried out by the quality- 
evaluating system shown in Fig. 1 is executed for a plurality of positions of the reference 
points 17, 18, whereupon various quality indications for the different positions will be ob- 
tained. It goes without saying that for each of the different positions of the reference points 
17, 18 different transfer functions must be ascertained and used when the method accord- 
ing to the present invention is being executed. The output obtained will then be a plurality of 
quality indications for different positions of the reference points 17, 18, i.e. for different head 
ypositions. 

yD/arious possibilities exist for evaluating the different quality indications. On the one hand, 
%n average value can be assumed so as to be able to make a general statement to the ef- 
fect that a certain coding/decoding method may perhaps be optimal, if the position of the 
iJjead is not changed at all, or that this method is less advantageous than some other coding 
■-method in the case of certain translations or bearing movements or rotations of the head. 

Nbn the other hand, the "worst case" of the individual measurements can be found out so as 
to be able to make a statement whether a certain coding/decoding method is sub-optimal in 
the case of a specific position of the head with respect to the five loudspeakers when 5- 
channel audio signals are processed. It will be advantageous to carry out such quality 
evaluations for a plurality of positions of the reference points 17, 18 close to the optimum 
reference listening position on the one hand. On the other hand, such measurements can 
also be carried out for other sites which are not located at the reference listening position so 
that e.g. certain other seats in a sound studio can be judged so as to find out whether or not 
coding/decoding errors can be heard there. 

The above description shows clearly that the system according to the present invention and 
the method according to the present invention provide existing quality-evaluating systems 
and methods with a substantial amount of flexibility in such a way that it is not only possible 
to evaluate the quality of audio signals with more than two channels but that it is also possi- 
ble to act out quality evaluations for different scenarios of positioning the reference points 
17, 18 relative to the loudspeakers 1 1 to 15, and that the system according to the present 



12 



invention and the method according to the present invention can even be used for design- 
ing sound studios or other listening rooms, such a cinemas, so as to be able to carry out a 
listening-adapted evaluation of the quality of specific coding/decoding methods in a specific 
room. Furthermore, the method according to the present invention and the system accord- 
ing to the present invention can be used for designing listening rooms so that the optimum 
coding method among a large number of possible coding methods can be selected for a 
specific room. 

The transfer functions UF1 1 - UF52 can be realized in the field of circuit technology in dif- 
ferent ways. Preferably, they are realized through an FIR filter for each impulse response. 
jfPleference should be made to the fact that for large rooms the FIR filters may have a con- 
siderable length; in the case of a sampling frequency of 48 kHz their length may e.g. ex- 
ceed 100,000 sampling values. In this case, it will be advisable to represent the first milli- 
seconds of this length, where the reflections occurring are primarily discrete reflections, 

fhore precisely than the time domain towards the end of the filter, where the reflections oc- 
curring are primarily diffuse reflections. 
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Claims 

1 . A system for evaluating the quality of an audio test signal derived from an audio refer- 
ence signal by coding and decoding, said audio test signal and said audio reference 
signal each comprising a plurality of channels, each channel being adapted to be made 
audible by one loudspeaker of a plurality of loudspeakers which are positioned at dif- 
ferent positions in an at least fictitious room, and two listening reference points being 
defined with respect to the positions of the plurality of loudspeakers, said system com- 
prising: 

S a unit for converting the audio reference signal into a first audio reference sum signal at 
ii the first reference point and into a second audio reference sum signal at the second 
D reference point and for converting the audio test signal into a first audio test sum signal 
at the first reference point and into a second audio test sum signal at the second refer- 
ence point, the audio reference sum signals and the audio test sum signals at the first 
^ and second reference points being a superposition of the respective channels, which 
3 can be emitted by said plurality of loudspeakers, weighted with a respective transfer 
g function between the respective loudspeaker and the reference point in question; and 

a unit for evaluating the quality of the audio test sum signals while taking into consid- 
eration the audio reference sum signals so as to provide an indication of the quality of 
the audio test signal. 

2. A system according to claim 1 , wherein the transfer functions between the respective 
loudspeakers and the respective reference points are individual head-related transfer 
functions so as to take into account the different impulse responses for different sound 
incidence directions into the human ear. 

3. A system according to claim 2, wherein the transfer functions between the respective 
loudspeakers and the respective reference points are mean head-related transfer func- 
tions obtained by averaging a large number of individuals. 

4. A system according to claim 1 , wherein the transfer function between the respective 
loudspeaker and the respective reference point is a transfer function which corre- 
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sponds to the convolution of the head-related transfer function with a room impulse re- 
sponse in such a way that the sound reflections of the room in which the plurality of 
loudspeakers and the two reference points are positioned are taken into account. 

5. A system according to claim 1 , wherein the transfer functions between the respective 
loudspeakers and the respective reference points are averaged transfer functions 
which are the result of averaging individual transfer functions between fixed loud- 
speaker positions and varying positions of the reference points. 

6. A system according to claim 1 , wherein said conversion unit is arranged for providing 
transfer functions for various positions of said first and second reference points with 

S3 respect to fixed loudspeaker positions, and wherein the quality-evaluating unit is ar- 
ii3 ranged for providing the indication of the quality of the audio test signal for various 
% transfer functions and for providing the positions of the reference points for the indica- 
tion of the poorest quality. 

3. A system according to claim 1 , wherein the room is a standardized reference listening 
room and wherein the two reference points simulate the auditory organs of a test per- 
p son at a reference listening position. 

8. A system according to claim 1 , wherein the room is a sound studio and wherein the two 
reference points simulate the auditory organs of a test person at an arbitrary 
seated/standing position in said room. 

9. A system according to claim 5 t wherein the different positions of the first and second 
reference points deviate only slightly from a reference position so as to simulate a bear- 
ing movement of a test person. 

1 0. A system according to claim 5 , wherein the different positions of the first and second 
reference points deviate markedly from the reference position so as to simulate a rota- 
tion of the head of a test listener. 

1 1 . A system according to claim 5 , wherein the audio test signal comprises five channels, 
said five channels being a left rear, a right rear, a left front, a right front and a middle 



15 



front channel. 

12. A system according to claim 1 , wherein the audio test signal is a stereo signal. 

13. A system according to claim 1 . wherein the conversion unit comprises: 

an FIR filter for each loudspeaker/reference-point combination, the filter coefficients of 
each FIR filter being determined by the transfer function of the transmission path from 
the respective loudspeaker to the respective reference point; 

a first adder for the first reference point for adding the output signals of the FIR filters, 
5 which represent transmission paths to the first reference point, so as to provide the first 
3 audio test sum signal and the first audio reference sum signal, respectively; and 

a second adder for the second reference point for adding the output signals of the FIR 
filters, which represent a transmission path to the second reference point, so as to pro- 
vide the second audio test sum signal and the second audio reference sum signal, re- 
y spectively. 

14. A method for evaluating the quality of an audio test signal derived from an audio refer- 
ence signal by coding and decoding, said audio test signal and said audio reference 
signal each comprising a plurality of channels, each channel being adapted to be made 
audible by one loudspeaker of a plurality of loudspeakers which are positioned at dif- 
ferent positions in an at least fictitious room, and two reference points being defined 
with respect to the positions of the plurality of loudspeakers, said method comprising 
the following steps: 

converting the audio reference signal into a first audio reference sum signal at the first 
reference point and into a second audio reference sum signal at the second reference 
point; 

converting the audio test signal into a first audio test sum signal at the first reference 
point and into a second audio test sum signal at the second reference point; 
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weighting the respective channels, which can be emitted by said plurality of loudspeak- 
ers, with a respective transfer function between the respective loudspeaker and the 
reference point in question; 

superimposing the weighted channels at said first and at said second reference point 
so as to obtain the audio reference sum signals and the audio test sum signals; and 

conducting the audio test sum signals and the audio reference sum signals to a unit for 
evaluating the quality of the audio test sum signals while taking into consideration the 
audio reference sum signals so as to obtain an indication of the quality of the audio test 
5 signal. 

€35. A method according to claim 14,wherein the following step precedes the step of con- 
verting: 

li s obtaining the individual transfer functions between each loudspeaker and each refer- 
1 ence point. 

Hi 6. A method according to claim 15, wherein the step of obtaining comprises the following 
sub-steps: 

exciting a loudspeaker with an excitation signal; 
measuring the signal at each reference point; 

determining the transfer function between the excited loudspeaker and the first refer- 
ence point; 

determining the transfer function between the excited loudspeaker and the second ref- 
erence point; and 

repeating the steps of exciting, measuring and determining until all the loudspeakers 
have been excited so as to obtain the individual transfer functions. 
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17. A method according to claim 16, wherein the first and second reference points are the 
ears of a human listener. 

18. A method according to claim 16, wherein the first and second reference points are built- 
in microphones of an artificial head. 

19. A method according to claim 16 , wherein the excitation signal is pseudo-noise signal. 

20. A method according to claim 15, wherein the step of obtaining comprises the following 
sub-steps: 

accessing a head-related transfer function for a determined positioning of a loud- 
i speaker relative to the first reference point; 

determining a room impulse response for the position of the loudspeaker in the room; 

convoluting the head-related transfer function with said room impulse response so as to 
i obtain the transfer function from said loudspeaker to the first reference point; 

repeating the steps of accessing, determining and convoluting so as to obtain the trans- 
fer function from said loudspeaker to the second reference point; and 

executing the steps of accessing, determining, folding and repeating for each additional 
loudspeaker so as to obtain all the individual transfer functions. 

21 . A method according to claim 20, wherein the room impulse response is determined by 
simulating the room. 
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System and method for evaluating the quality of multi-channel audio signals 

Abstract 

A system for evaluating the quality of an audio test signal derived from an audio reference 
signal by coding and decoding, said audio test signal and said audio reference signal each 
comprising a plurality of channels, comprises a unit for converting the audio reference sig- 
nal into a first audio reference sum signal at a first reference point and into a second audio 
yieference sum signal at a second reference point and for converting the audio test signal 
S^nto a first audio test sum signal at the first reference point and into a second audio test sum 
Signal at the second reference point, the audio reference sum signals and the audio test 
yf urn signals at the first and second reference points being a superposition of the respective 
Channels, which can be emitted by a plurality of loudspeakers, weighted with a respective 
^ transfer function between the respective loudspeaker and the reference point in question, 
hand a unit for evaluating the quality of the audio test sum signals while taking into consid- 
eration the audio reference sum signals so as to provide an indication of the quality of the 
tludio test signal. The system according to the present invention permits real rooms and an 
arbitrary number of channels of the audio test signal to be taken into account so as to exe- 
cute a listening-adapted evaluation of the quality of a specific coding/decoding method. 
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System and method for evaluating the quality of multi-channel audio signals 

[Description] 
Field of the Invention 

The present invention relates to quality evaluation and in particular to a System and method 
for evaluating the quality of multi-channel audio signals- 
Background of the Invention and Prior Art 

SSince listening-adapted digital coding methods have been standardized, they have been 
iused to an increasing extent. Examples for such cases of use are the digital compact cas- 
sette, the minidisk, digital terrestrial radio broadcasting and the digital video disk. When 
Icoding is effected by means of listening-adapted coding methods, artificial products or arti- 
facts may, however, occur, which did not occur in analog audio signal processing. 

^Forjudging or evaluating a specific encoder, listening test with test persons were carried 
3out in the past. Although the average result provided by such listening tests is compara- 
tively reliable, there is still a subjective component. Furthermore, listening tests executed 
with a certain number of test persons are comparatively complicated and therefore com- 
paratively expensive. Hence, measurement methods have been developed for a listening- 
adapted evaluation of audio signals. 

Such a measurement method is described e.g. in DE 196 47 399 C1. The method of listen- 
ing-adapted quality evaluation described in this publication models all non-linear hearing 
effects onto a reference signal as well as onto a test signal. The listening-adapted quality 
evaluation is carried out by means of a comparison in the cochlear domain. In so doing, the 
excitations caused in the ear by the test signal and by the reference signal are compared. 
For this purpose, both the audio reference signal and the audio test signal are divided into 
their spectral components by a filter bank. By means of a large number of filters whose fre- 
quencies overlap, a sufficient resolution with respect to time as well as frequency is guar- 
anteed. Hence, a mono audio test signal, which is derived from an audio reference signal 
by coding and subsequent decoding, can be evaluated with regard to its quality. 



The measurement method described in DE 196 47 399 D1 also permits an evaluation of the 
quality of stereo signals, i.e. two-channel signals. For this purpose, non-linear preprocess- 
ing is carried out with the left and with the right channel of the audio test signal and of the 
audio reference signal; this preprocessing emphasizes transients in a frequency-selective 
manner and reduces stationary signals. In particular, different detections of the error prob- 
ability are carried out with the left channel of the audio reference signal and with the left 
channel of the audio test signal as input signals, with the right channel of audio reference 
signal and with the right channel of the audio test signal as input signals, with the left chan- 
nel of the preprocessed audio reference signal and with the left channel of the preproc- 
Jessed audio test signal as input signals and with the right channel of the preprocessed 
t audio reference signal and with the right channel of the preprocessed audio test signal as 
i input signals so as to obtain a measure of the quality of the stereophonic audio test signal. 

I A disadvantage of the known method for listening-adapted quality evaluation of audio sig- 
nals is the fact that the stereo ability is limited to a reproduction by headphones alone. In 
I other words, the audio test signal which enters the ear of a listener is compared with the 
i audio reference signal which enters the ear of a listener. This means that effects produced 
3 by a room, such as reflections on the walls, on the ceiling and on the floor, multiple reflec- 
tions, attenuations, etc., are not taken into account. Furthermore, known quality-evaluating 
methods are not able to take into account any directional characteristic of the human ear, 
i.e. it makes no difference whether a signal comes from the rear, from the front or from the 
side. Known measurement methods are only applicable to headphone reproduction in the 
case of which the acoustic signal is emitted by the headphone loudspeaker, which is nor- 
mally arranged directly on the ear, and is introduced in the ear or the quality-evaluating pro- 
cess. 

The known method is also disadvantageous insofar as a listening-adapted quality evalua- 
tion of multi-channel signals, such as e.g. 5-channel signals, which become more and more 
common and which are known under the headword "Dolby surround", has been absolutely 
impossible up to now. 



Summary of the Invention 



It is the object of the present invention to provide an improved concept for evaluating the 
quality of audio signals in the case of which room effects are additionally taken into account. 

[This object is achieved by a quality-evaluating system according to claim 1 and by a qual- 
ity-evaluating method according to claim 14.] 

In accordance with a first aspect of the invention, this object is achieved by a system for 
evaluating the quality of an audio test signal derived from an audio refe rence signal by 
f coding and decoding, said audio test signal and said audio reference signal each compris- 
jing a plurality of channels, each channel being adapted to be made audible bv one loud- 
i speaker of a plurality of loudspeakers which are positioned at different positions in an at 
^ least fictitious room, and two listening reference points being defined with re spect to the 
l oositions of the plurality of loudspeakers, said system comprising: a unit for converting the 
audio reference signal into a first audio reference sum signal at the first reference point and 
j into a second audio reference sum signal at the second reference point and for converting 
I the audio test signal into a first audio test sum signal at the first reference point and into a 
3 second audio test sum signal at the second reference point, the audio refere nce sum sig- 
nals and the audio test sum signals at the first and second reference points being a super- 
position of the respective channels, which can be emitted bv said plurality of loudspeakers, 
weighted with a respective transfer function between the respective loudspeaker and the 
reference point in guestion: and a unit for evaluating the gualitv of the aud io test sum sig- 
nals while taking into consideration the audio reference sum signals so as to provide an 
indication of the gualitv of the audio test signal. 

In accordance with a second aspect of the invention, this object is achieved bv a method for 
evaluating the gualitv of an audio test signal derived from an audio reference signal by 
coding and decoding, said audio test signal and said audio reference signal each compris- 
ing a plurality of channels, each channel being adapted to be made audible bv one loud- 
speaker of a plurality of loudspeakers which are positioned at different positions in an at 
least fictitious room, and two reference points being defined with respect to the positions of 
the plurality of loudspeakers, said method comprising the following steps: converting the 
audio reference signal into a first audio reference sum signal at the first reference point and 
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into a second audio reference sum signal at the second reference point; converting the 
audio test signal into a first audio test sum signal at the first reference point and into a sec- 
ond audio test sum signal at the second reference point; weighting the respective channels, 
which can be emitted by said plurality of loudspeakers, with a respective transfer function 
between the respective loudspeaker and the reference point in guestion: superimposing the 
weighted channels at said first and at said second reference point so as to obtain the audio 
reference sum signals and the audio test sum signals; and conducting the audio test sum 
signals and the audio reference sum signals to a unit for evaluating the gualitv of the audio 
test sum signals while taking into consideration the audio reference sum signals so as to 
obtain an indication of the gualitv of the audio test signal. 

The present invention is based on the finding that, although signals comprising an arbitrary 
Cmumber of channels exist, the human listener, who counts in the final analysis, always has 
J;;/ only two ears at his disposal. Directional hearing is caused by the different impulse re- 
sponses for different incidence directions of sound signals into the human ear. The different 
L impulse responses for different incidence directions are referred to as "head-related transfer 
^functions" in the field of technology. In reality, there are not only the direct sound paths be- 
yp tween the ear and the loudspeaker, but reflections on the walls, on the ceiling and on the 
y floor occur as well. This can be summarized as room impulse response. The HRTFs and 
the room impulse response lead, in combination, to a change of sound which can, accord- 
ing to the present invention, also be evaluated by measurement systems without explicit 
modelling of binaural effects, such as different masking thresholds for binaural signals in 
comparison with monaural signals, perception of phase shifts, precedence effects, etc.. 

When audio signals are evaluated by means of listening tests, standardized listening 
rooms, which have been standardized e.g. according to ITU-R BS.1116, are normally used. 
The size, the loudspeaker arrangement and the reverberation time are largely determined in 
this case. When a more comprehensive quality evaluation of audio signals is carried out in 
accordance with the present invention, both the head-related transfer functions (HRTFs) as 
well as the room impulse responses can be taken into account. For the listening-adapted 
quality evaluation according to the present invention it is, furthermore, of no importance 
whether a signal is a stereo signal which is emitted by two loudspeakers for the left and for 
the right channel, or whether the signal is a multi-channel signal comprising e.g. five chan- 
nels and emitted by five loudspeakers which are positioned with respect to a listener e.g. in 



such a way that the loudspeakers are arranged at the rear left, front left, rear right, front 
right and at the front. 

The quality-evaluating system according to the present invention comprises for this purpose 
a unit for converting the audio reference signal into a first audio reference sum signal at a 
first reference point and into a second audio reference sum signal at a second reference 
point and a unit for converting the audio test signal into a first audio test sum signal at the 
first reference point and into a second audio test sum signal at the second reference point, 
the audio reference sum signals and the audio test sum signals at the first and second ref- 
erence points being a superposition of the respective channels, which can be emitted by the 
plurality of loudspeakers, weighted with a respective transfer function between the respec- 
tive loudspeaker and the reference point in question. The audio reference sum signals and 
the audio test sum signals are finally fed into a quality-evaluating unit so as to obtain an 
* indication for the quality of the audio test signal. The quality-evaluating unit can be an arbi- 
i trary known unit of the type disclosed e.g. in DE 196 47 399 C1 or of the type specified in 
the international standard ITU-R BS 1387 (PEAQ). 

The method according to the present invention is advantageous with regard to the fact that, 
when the audio signal is a stereo signal, the influences of the listening room on the signal 
propagation from each loudspeaker to each reference point, i.e. each ear, can be taken into 
account. 

Another advantage is to be seen in the fact that the method is applicable to audio signals 
comprising an arbitrary number of channels, since the channels are converted into two sum 
signals via respective transfer functions modelling the propagation of a signal from one 
loudspeaker to one ear, in such a way that an arbitrary quality-evaluating method, which is 
suitable for two channels, can be used. 

Normally, the individual transfer functions can be gained by measurement making use of 
built-in microphones with an artificial head or of probe microphones with a human listener. 
The method according to the present invention will, however, be particularly advantageous 
when the head-related transfer functions of arbitrary persons are already known and can 
e.g. be downloaded via the Internet from a suitable server. In this case, the room impulse 
response which occurs in a listening room and which can be measured or simulated can be 



convoluted with a specific existing HRTF so as to obtain a transfer function. This will be ad- 
vantageous especially in cases where the listening room does not yet exist, i.e. where the 
acoustic properties of a room are simulated prior to actually constructing the room so as to 
simulate the acoustic properties when e.g. concert halls or sound studios are planned and 
so as to optimize the listening room already prior to its construction. 

Brief Description of the Drawings 

In the following, preferred embodiments of the present invention will be explained in detail 
I making reference to the drawings enclosed, in which: 

{ Fig. 1 shows a schematic block diagram of a system according to the present 

invention; 

Fig. 2 shows a schematic diagram for determining the head-related transfer 

functions (HRTFs); and 

I Fig. 3 shows a schematic block diagram for representing the situation in a real 

listening room. 

Detailed Description of the Preferred Embodiments 

Fig. 1 shows a schematic block diagram of a system for evaluating the quality of an audio 
test signal derived from an audio reference signal by coding and decoding. The audio test 
signal and the audio reference signal each comprise a plurality of channels; each channel 
can be made audible by one loudspeaker of a plurality of loudspeakers 11 to 15 which are 
positioned at different positions in an at least fictitious room, two reference points 17, 18 for 
simulating the hearing being defined with respect to the positions of the plurality of loud- 
speakers 11 to 15. The quality-evaluating system includes a unit 19 for converting the audio 
reference signal into a first audio reference sum signal at the first reference point 17 and 
into a second audio reference sum signal at the second reference point 18 and for convert- 
ing the audio test signal into a first audio test sum signal at the first reference point 17 and 
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into a second audio test sum signal at the second reference point 18, the audio reference 
sum signals and the audio test sum signals at the first and second reference points 17, 18 
being a superposition of the respective channels which can be emitted by said plurality of 
loudspeakers 11 to 15, weighted with a respective transfer function UF11 to UF52 between 
the respective loudspeaker 11 to 15 and the reference point 17, 18 in question. The quality- 
evaluating system additionally includes a unit 20 for evaluating the quality of the audio test 
sum signals while taking into consideration the audio reference sum signals so as to pro- 
vide an indication of the quality of the audio test signal at an output 21 . 

In the following, the conversion unit 19 will be explained. This unit comprises a plurality of 
^transfer functions UF11 to UF52, which are either the HRTFs, when an anechoic room, i.e. 
I a room in which no reflections occur, is considered, or which are the whole transfer func- 
tion of the room from one of the loudspeakers 1 to 5 to a reference point 1 , 2. As can be 
iseen in Fig. 1, the output signals of the loudspeakers are weighted with the respective 
jtransfer functions. The output signals produced when the input signals are weighted with 
the respective transfer functions are added by means of a first adder 22 so as to obtain first 
jaudio sum signals. Analogously, a second adder 23 is provided for the second reference 
!; point 18 so as to add the output signals of the transfer functions from the respective loud- 
ispeakers 1 1 to 15 to the second reference point 18 so as to provide the second audio sum 
" signals. It goes without saying that the audio test signal as well as the audio reference sig- 
nal are processed by means of the conversion unit 19 in such a way that the same condi- 
tions prevail for both the audio reference signal and the audio test signal in such a way that 
the unit 20 for evaluating the quality of 2-channel signals will only measure the quality of 
coding/decoding and that no other effects will disturb the measurement result. 

Although Fig. 1 shows the situation for a 5-channel audio signal, the system according to 
the present invention is also applicable to stereo signals comprising only two channels or to 
signals comprising three, four or more than five channels. In this case, it will only be neces- 
sary to add or to omit respective transfer functions. Furthermore, it should be pointed out 
that the positioning of the loudspeakers in Fig. 1 is only schematic. A correct positioning of 
the loudspeakers with respect to the reference points is shown in Fig. 2 and 3 for a 5- 
channel signal example. 



With respect to the notation of the individual transfer functions reference should be made to 
the fact that the first figure always refers to the loudspeaker, whereas the second figure re- 
fers to the reference point, i.e. reference point No. 1 (17) or reference point No. 2 (18). 

Fig. 2 shows a possible arrangement of the five loudspeakers 1 1 to 15 with respect to a 
listener 24 whose head is schematically shown in Fig. 2 in a top view. Alternatively, the 
head 24 may be an artificial head. In any case, the head 24 comprises the first reference 
point 17 and the second reference point 18, i.e. the ears 17, 18 in the case of a human lis- 
tener or the built-in microphones 17, 18 in the case of an artificial head 24. In Fig. 2, trans- 
mission paths in the anechoic room from each of the loudspeakers 11 to 15 to each refer- 
ence point 17, 18 are depicted. The head-related transfer functions (HRTFs) are deter- 
mined by screening e.g. the head or the shoulders of the person listening and by different 
; transmission times. Arrow 31a, for example, represents the transmission path from the first 
i loudspeaker 1 1 to the first reference point 1 7. Arrow 31 b, which is depicted in the form of a 
{broken line in the area of the head 24, represents the HRTF from the first loudspeaker 1 1 to 
the second reference point 18. Analogously, arrow 32a represents the transfer function from 
jthe second loudspeaker 12 to the first reference point, i.e. UF21 in Fig. 1. Arrow 32b repre- 
sents in a corresponding manner the transfer function from the second loudspeaker 12 to 
ithe second reference point 18, i.e. 0F22 in Fig. 1. By adding the sub-signals of the loud- 
speaker output signals, which have been weighted with the respective transfer function, at 
the reference points 17, 18, the first and second audio test sum signals and audio reference 
sum signals are then obtained; these audio test sum signals and audio reference sum sig- 
nals can then be fed into an arbitrary quality-evaluating unit 22 for 2-channel signals so as 
to obtain a measure of the quality of the audio test signal, which is a 5-channel signal in the 
case shown in Fig. 2. 

As has already been mentioned, the scenario in Fig. 2 shows how the head-related transfer 
functions are gained in the anechoic room. This means that, when the HRTFs are gained by 
measurement, the room must be of such a nature that no sound reflectors exist within the 
room, i.e. that the whole room must be provided with a sound absorbing lining. 

Fig. 3 shows a schematic representation of transmission paths in a listening room 30 in 
which the loudspeakers 11, 12, 13, 14, 15 are arranged in the same way as in Fig. 2. In 
addition to the direct sound, an indirect path from each loudspeaker to the left ear 17 is 
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shown here. Reference should be made to the fact that the scenario in Fig. 3 does not fully 
reflect the reality, since reflections occur here on all the walls, the floor and the ceiling and 
since multiple reflections exist as well. In detail, the first loudspeaker 11 additionally emits 
sound which, as shown by a line 31c, is reflected on the front wall of the room 30, propa- 
gates from the front wall and arrives at the first reference point 17. It follows that the transfer 
function from the first loudspeaker 1 1 to the left ear 1 7, i.e. UF1 1 in Fig. 1 , models not only 
direct sound propagation 31a from the loudspeaker to the ear but also sound propagation 
by means of a reflection 31c from the first loudspeaker 1 1 to the first ear 17. Analogously, 
there is also an indirect path, which is indicated by an arrow 32c, from the second loud- 
speaker 12 to the first ear 17. This means that the transfer function UF21 in Fig. 1 from the 
second loudspeaker 12 to the first reference point 17 models not only direct sound propa- 
gation 32a but also sound propagation by means of reflection to the first ear 17. 

In the following, the determination of the individual transfer functions 0F11 to Uf52 (Fig. 1) 
will be explained. There are various possibilities of determining these transfer functions. 

The first possibility is to position the loudspeakers 11 to 15 relative to the reference points 
17 and 18 in the manner shown in Fig. 3. Subsequently, the first loudspeaker 1 1 is excited 
by means of an excitation signal, whereupon the sound signal arriving at the first reference 
point 17 is measured at this reference point; considering Fig. 3, this sound signal is a su- 
perposition of the signals 31a, 31c. In addition, the sound signal at the second reference 
point 18 is measured; this sound signal could be a superposition of signal 31b and of a sig- 
nal which is not shown in Fig. 3 and which is emitted by the first loudspeaker 1 1 and re- 
flected on some wall or other in such a way that it arrives at the second reference point 1 8. 

The transfer function from the first loudspeaker to the first reference point 17 (UF1 1 in Fig. 
1) can be calculated from the excitation signal and from the sound signal measured at the 
first reference point 17. If the loudspeaker 11 is excited by means of an ideal pulse, the re- 
spective impulse response, which describes the transmission of the sound signal in the time 
domain, will be obtained directly at the reference points. In view of practical restrictions, this 
is, however, only a theoretical method, whereas, in practice, the loudspeaker 11 is excited 
by a pseudo-noise signal. This method is repeated for the additional loudspeakers 12 to 15 
in such a way that all the additional transfer functions UF21 to UF52 can be determined 
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from the measured sound signal at the respective reference point and from the excitation 
signal at the respective loudspeaker. 

If, as has been stated, such measurements take place in a real space with non-absorbing 
walls, etc., the whole transfer function, which comprises the room impulse response and the 
head-related transfer functions (HRTFs) for the individual loudspeaker positions, will be 
determined directly. If such measurements are carried out in an anechoic room, i.e. in a 
fully sound-absorbing room, the HRTFs can be determined directly in this way; these 
HRTFs are then the transfer functions 0F1 1 to GF52. 

Irrespectively of whether the measurement is carried out by means of two built-in micro- 
phones and an artificial head or by means of two probe microphones and a test person, 
such sound measurements are complicated and expensive not least in view of the very ex- 
pensive probe microphones. 

If, however, head-related transfer functions (HRTFs) are known for specific persons or also 
for an "average person", these head-related transfer functions can be used for being con- 
voluted with the impulse response of a room; this impulse response can also be simulated. 
In this case, no measurements will be necessary for determining the transfer functions 
0F1 1 to 0F52. A substantial advantage of this method is that it can also be used for simu- 
lating rooms which have not yet been constructed so as to design e.g. a sound studio for an 
optimum sound propagation for specific loudspeaker configurations prior to actually con- 
structing this sound studio. It follows that, in this case, it can no longer be said that the room 
in which the quality of a coded and subsequently decoded audio test signal is to be evalu- 
ated actually exists. Instead, the room only exists in a simulated form and is therefore a fic- 
titious room. 

Irrespectively of whether the room actually exists or whether it only exists as a fictitious 
room on the basis of a simulation, it is normally assumed that test persons are seated or 
stand in such a listening room, which may e.g. be a standardized listening room, at the best 
possible listening position. However, many test persons move their head to the front, to the 
rear, to the left or to the right while the test is taking place; this is also referred to as transla- 
tion. In addition, the persons will normally move slightly away from the optimum listening 
position, i.e. they move their heads to the left and to the right, this being also referred to as 



bearing movements or rotation. Hence, a possibly existing middle loudspeaker, i.e. the 
loudspeaker 13, will no longer be located precisely in the middle. This happens because the 
directional perception is often unsure precisely at the front. In particular, the front and the 
back are confused in many cases. This is also referred to as "front-back confusion" in the 
field of technology. Making reference to Fig. 2 and 3, it can be seen that the first reference 
point 17 and the second reference point 18 change with respect to the fixed positions of the 
loudspeakers in the case of each movement of the head. 

In order to cope with this situation, the quality-evaluating method carried out by the quality- 
evaluating system shown in Fig. 1 is executed for a plurality of positions of the reference 
points 17, 18, whereupon various quality indications for the different positions will be ob- 
tained. It goes without saying that for each of the different positions of the reference points 
17, 18 different transfer functions must be ascertained and used when the method accord- 
ing to the present invention is being executed. The output obtained will then be a plurality of 
quality indications for different positions of the reference points 17, 18, i.e. for different head 
positions. 

Various possibilities exist for evaluating the different quality indications. On the one hand, 
an average value can be assumed so as to be able to make a general statement to the ef- 
fect that a certain coding/decoding method may perhaps be optimal, if the position of the 
head is not changed at all, or that this method is less advantageous than some other coding 
method in the case of certain translations or bearing movements or rotations of the head. 

On the other hand, the "worst case" of the individual measurements can be found out so as 
to be able to make a statement whether a certain coding/decoding method is sub-optimal in 
the case of a specific position of the head with respect to the five loudspeakers when 5- 
channel audio signals are processed. It will be advantageous to carry out such quality 
evaluations for a plurality of positions of the reference points 17, 18 close to the optimum 
reference listening position on the one hand. On the other hand, such measurements can 
also be carried out for other sites which are not located at the reference listening position so 
that e.g. certain other seats in a sound studio can be judged so as to find out whether or not 
coding/decoding errors can be heard there. 
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The above description shows clearly that the system according to the present invention and 
the method according to the present invention provide existing quality-evaluating systems 
and methods with a substantial amount of flexibility in such a way that it is not only possible 
to evaluate the quality of audio signals with more than two channels but that it is also possi- 
ble to act out quality evaluations for different scenarios of positioning the reference points 
17, 18 relative to the loudspeakers 1 1 to 15, and that the system according to the present 
invention and the method according to the present invention can even be used for design- 
ing sound studios or other listening rooms, such a cinemas, so as to be able to carry out a 
listening-adapted evaluation of the quality of specific coding/decoding methods in a specific 
room. Furthermore, the method according to the present invention and the system accord- 
ing to the present invention can be used for designing listening rooms so that the optimum 
coding method among a large number of possible coding methods can be selected for a 
specific room. 

The transfer functions 0F1 1 - 0F52 can be realized in the field of circuit technology in dif- 
ferent ways. Preferably, they are realized through an FIR filter for each impulse response. 
Reference should be made to the fact that for large rooms the FIR filters may have a con- 
siderable length; in the case of a sampling frequency of 48 kHz their length may e.g. ex- 
ceed 100,000 sampling values. In this case, it will be advisable to represent the first milli- 
seconds of this length, where the reflections occurring are primarily discrete reflections, 
more precisely than the time domain towards the end of the filter, where the reflections oc- 
curring are primarily diffuse reflections. 
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Claims 



1 . A system for evaluating the quality of an audio test signal derived from an audio refer- 
ence signal by coding and decoding, said audio test signal and said audio reference 
signal each comprising a plurality of channels, each channel being adapted to be made 
audible by one loudspeaker [(1 1 - 15)] of a plurality of loudspeakers which are posi- 
tioned at different positions in an at least fictitious room [(30)], and two listening refer- 
ence points [(17, 18)] being defined with respect to the positions of the plurality of loud- 
speakers, said system comprising: 

a unit [(19)] for converting the audio reference signal into a first audio reference sum 
signal at the first reference point [(17)] and into a second audio reference sum signal at 
the second reference point [(18)] and for converting the audio test signal into a first 
audio test sum signal at the first reference point [(17)] and into a second audio test sum 
signal at the second reference point [(18)], the audio reference sum signals and the 
audio test sum signals at the first and second reference points [(1 7, 1 8)] being a super- 
position of the respective channels, which can be emitted by said plurality of loud- 
speakers [(11 - 15)], weighted with a respective transfer function [(0F11 - 0F52)] be- 
tween the respective loudspeaker and the reference point in question; and 

a unit [(20)] for evaluating the quality of the audio test sum signals while taking into 
consideration the audio reference sum signals so as to provide an indication of the 
quality of the audio test signal. 

2. A system according to claim 1 , wherein the transfer functions [(CIF1 1 - QF52)] between 
the respective loudspeakers [(11 - 15)] and the respective reference points [(17, 18)] 
are individual head-related transfer functions [(HRTFs)] so as to take into account the 
different impulse responses for different sound incidence directions into the human ear 
[(17,18)]. 

3. A system according to claim 2, wherein the transfer functions [(0F1 1 - UF52)] between 
the respective loudspeakers [(1 1-15)] and the respective reference points [(17, 18)] 
are mean head-related transfer functions [(HRTFs)] obtained by averaging a large 
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number of individuals. 

4. A system according to [one of the preceding claims] claim 1 . wherein the transfer func- 
tion [(GF1 1 - 0F52)] between the respective loudspeaker [(11 - 15)] and the respective 
reference point [(17, 18)] is a transfer function which corresponds to the convolution of 
the head-related transfer function with a room impulse response in such a way that the 
sound reflections of the room in which the plurality of loudspeakers [(1 1-15)] and the 
two reference points [(17, 18)] are positioned are taken into account. 

5. A system according to [one of the preceding claims] claim 1 , wherein the transfer func- 
tions [(GF11 - GF52)] between the respective loudspeakers [(11 - 15)] and the respec- 
tive reference points [(17, 18)] are averaged transfer functions which are the result of 
averaging individual transfer functions between fixed loudspeaker positions and varying 
positions of the reference points [(17, 18)]. 

6. A system according to [one of the claims 1 to 4] claim 1 , wherein said conversion unit 
[(19)] is arranged for providing transfer functions for various positions of said first and 
second reference points [(17, 18)] with respect to fixed loudspeaker positions, and 
wherein the quality-evaluating unit [(20)] is arranged for providing the indication of the 
quality of the audio test signal for various transfer functions and for providing the posi- 
tions of the reference points [(17, 18)] for the indication of the poorest quality. 

7. A system according to [one of the preceding claims] claim 1 . wherein the room [(30)] is 
a standardized reference listening room and wherein the two reference points [(17, 18)] 
simulate the auditory organs of a test person at a reference listening position. 

8. A system according to [one of the claims 1 to 6] claim 1 , wherein the room [(30)] is a 
sound studio and wherein the two reference points simulate the auditory organs of a 
test person at an arbitrary seated/standing position in said room. 

9. A system according to [one of the claims 5 to 8] claim 5 . wherein the different positions 
of the first and second reference points [(17, 18)] deviate only slightly from a reference 
position so as to simulate a bearing movement of a test person. 
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10. A system according to [one of the claims 5 to 8] claim 5 , wherein the different positions 
of the first and second reference points deviate markedly from the reference position so 
as to simulate a rotation of the head of a test listener. 

1 1 . A system according to [one of the preceding claims] claim 5 , wherein the audio test 
signal comprises five channels, said five channels being a left rear, a right rear, a left 
front, a right front and a middle front channel. 

12. A system according to [one of the claims 1 to 10] claim 1 . wherein the audio test signal 
is a stereo signal. 

13. A system according to [one of the preceding claims] claim 1 . wherein the conversion 
unit [(1 9 )] comprises: 

an FIR filter for each loudspeaker/reference-point combination, the filter coefficients of 
each FIR filter being determined by the transfer function of the transmission path from 
the respective loudspeaker to the respective reference point; 

a first adder [(22)] for the first reference point [(17)] for adding the output signals of the 
FIR filters [(0F1 1 - 0F51)], which represent transmission paths to the first reference 
point [(17)], so as to provide the first audio test sum signal and the first audio reference 
sum signal, respectively; and 

a second adder [(23)] for the second reference point [(18)] for adding the output signals 
of the FIR filters [(UF12 - UF52)], which represent a transmission path to the second 
reference point [(18)], so as to provide the second audio test sum signal and the sec- 
ond audio reference sum signal, respectively. 

14. A method for evaluating the quality of an audio test signal derived from an audio refer- 
ence signal by coding and decoding, said audio test signal and said audio reference 
signal each comprising a plurality of channels, each channel being adapted to be made 
audible by one loudspeaker [(1 1 - 15)] of a plurality of loudspeakers which are posi- 
tioned at different positions in an at least fictitious room [(30)], and two reference points 
[(17, 18)] being defined with respect to the positions of the plurality of loudspeakers, 
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said method comprising the following steps: 

converting [(19)] the audio reference signal into a first audio reference sum signal at the 
first reference point [(17)] and into a second audio reference sum signal at the second 
reference point [(18)]; 

converting the audio test signal into a first audio test sum signal at the first reference 
point [(17)] and into a second audio test sum signal at the second reference point 
[(18)]; 

weighting the respective channels, which can be emitted by said plurality of loudspeak- 
ers [(11 - 15)], with a respective transfer function [(0F11 - UF52)] between the respec- 
tive loudspeaker and the reference point in question; 

superimposing the weighted channels at said first and at said second reference point 
[(17, 18)] so as to obtain the audio reference sum signals and the audio test sum sig- 
nals; and 

conducting the audio test sum signals and the audio reference sum signals to a unit 
[(20)] for evaluating the quality of the audio test sum signals while taking into consid- 
eration the audio reference sum signals so as to obtain an indication of the quality of 
the audio test signal. 

15. A method according to claim 14,wherein the following step precedes the step of con- 
verting [(19)]: 

obtaining the individual transfer functions [(UF11 - UF52)] between each loudspeaker 
[(11 - 15)] and each reference point [(17, 18)]. 

16. A method according to claim 15, wherein the step of obtaining comprises the following 
sub-steps: 

exciting a loudspeaker [(11 - 15)] with an excitation signal; 
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measuring the signal at each reference point [(1 7, 1 8)]; 

determining the transfer function between the excited loudspeaker and the first refer- 
ence point [(17)]; 

determining the transfer function between the excited loudspeaker and the second ref- 
erence point [(18)]; and 

repeating the steps of exciting, measuring and determining until all the loudspeakers 
[(1 1-15)] have been excited so as to obtain the individual transfer functions. 

17. A method according to claim 16, wherein the first and second reference points [(17, 
18)] are the ears of a human listener. 

18. A method according to claim 16, wherein the first and second reference points are built- 
in microphones of an artificial head. 

19. A method according to [one of the claims 16 to 18] claim 16 , wherein the excitation sig- 
nal is pseudo-noise signal. 

20. A method according to claim 1 5, wherein the step of obtaining comprises the following 
sub-steps: 

accessing a head-related transfer function [(HRTF)] for a determined positioning of a 
loudspeaker [(11-1 5)] relative to the first reference point [(1 7)]; 

determining [the] a room impulse response for the position of the loudspeaker in the 
room; 

convoluting the head-related transfer function [(HRTF)] with said room impulse re- 
sponse so as to obtain the transfer function from said loudspeaker to the first reference 
point [(17)]; 

repeating the steps of accessing, determining and convoluting so as to obtain the 
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transfer function [(UF11 - UF52)] from said loudspeaker to the second reference point; 
and 

executing the steps of accessing, determining, folding and repeating for each additional 
loudspeaker so as to obtain all the individual transfer functions. 

21. A method according to claim [19] 20, wherein the room impulse response is determined 
by simulating the room. 
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System and method for evaluating the quality of multi-channel audio signals 

Abstract 

A system for evaluating the quality of an audio test signal derived from an audio reference 
signal by coding and decoding, said audio test signal and said audio reference signal each 
comprising a plurality of channels, comprises a unit [(19)] for converting the audio reference 
signal into a first audio reference sum signal at a first reference point [(17)] and into a sec- 
ond audio reference sum signal at a second reference point [(18)] and for converting the 
audio test signal into a first audio test sum signal at the first reference point [(17)] and into a 
second audio test sum signal at the second reference point [(18)], the audio reference sum 
signals and the audio test sum signals at the first and second reference points [(17, 18)] 
being a superposition of the respective channels, which can be emitted by a plurality of 
loudspeakers [(11 - 15)], weighted with a respective transfer function [(UF11 - 0F52)] be- 
tween the respective loudspeaker and the reference point in question, and a unit [(20)] for 
evaluating the quality of the audio test sum signals while taking into consideration the audio 
reference sum signals so as to provide an indication of the quality of the audio test signal. 
The system according to the present invention permits real rooms and an arbitrary number 
of channels of the audio test signal to be taken into account so as to execute a listening- 
adapted evaluation of the quality of a specific coding/decoding method. 




Fig. 1 




Fig. 2 




Fig. 3 
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System and method for evaluating the quality of multi-channel audio signals 



Since listening-adapted digital coding methods have been standardized, they have been 
used to an increasing extent. Examples for such cases of use are the digital compact cas- 
sette, the minidisk, digital terrestrial radio broadcasting and the digital video disk. When 
coding is effected by means of listening-adapted coding methods, artificial products or arti- 
facts may, however, occur, which did not occur in analog audio signal processing. 

Forjudging or evaluating a specific encoder, listening test with test persons were carried 
out in the past. Although the average result provided by such listening tests is compara- 
tively reliable, there is still a subjective component. Furthermore, listening tests executed 
with a certain number of test persons are comparatively complicated and therefore com- 
paratively expensive. Hence, measurement methods have been developed for a listening- 
adapted evaluation of audio signals. 

Such a measurement method is described e.g. in DE 196 47 399 C1 . The method of listen- 
ing-adapted quality evaluation described in this publication models all non-linear hearing 
effects onto a reference signal as well as onto a test signal. The listening-adapted quality 
evaluation is carried out by means of a comparison in the cochlear domain. In so doing, the 
excitations caused in the ear by the test signal and by the reference signal are compared. 
For this purpose, both the audio reference signal and the audio test signal are divided into 
their spectral components by a filter bank. By means of a large number of filters whose fre- 
quencies overlap, a sufficient resolution with respect to time as well as frequency is guar- 
anteed. Hence, a mono audio test signal, which is derived from an audio reference signal 
by coding and subsequent decoding, can be evaluated with regard to its quality. 

The measurement method described in DE 196 47 399 D1 also permits an evaluation of the 
quality of stereo signals, i.e. two-channel signals. For this purpose, non-linear preprocess- 
ing is carried out with the left and with the right channel of the audio test signal and of the 
audio reference signal; this preprocessing emphasizes transients in a frequency-selective 
manner and reduces stationary signals. In particular, different detections of the error prob- 
ability are carried out with the left channel of the audio reference signal and with the left 
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channel of the audio test signal as input signals, with the right channel of audio reference 
signal and with the right channel of the audio test signal as input signals, with the left chan- 
nel of the preprocessed audio reference signal and with the left channel of the preproc- 
essed audio test signal as input signals and with the right channel of the preprocessed 
audio reference signal and with the right channel of the preprocessed audio test signal as 
input signals so as to obtain a measure of the quality of the stereophonic audio test signal. 

A disadvantage of the known method for listening-adapted quality evaluation of audio sig- 
nals is the fact that the stereo ability is limited to a reproduction by headphones alone. In 
other words, the audio test signal which enters the ear of a listener is compared with the 
audio reference signal which enters the ear of a listener. This means that effects produced 
by a room, such as reflections on the walls, on the ceiling and on the floor, multiple reflec- 
tions, attenuations, etc., are not taken into account. Furthermore, known quality-evaluating 
methods are not able to take into account any directional characteristic of the human ear, 
i.e. it makes no difference whether a signal comes from the rear, from the front or from the 
side. Known measurement methods are only applicable to headphone reproduction in the 
case of which the acoustic signal is emitted by the headphone loudspeaker, which is nor- 
mally arranged directly on the ear, and is introduced in the ear or the quality-evaluating pro- 
cess. 

The known method is also disadvantageous insofar as a listening-adapted quality evalua- 
tion of multi-channel signals, such as e.g. 5-channel signals, which become more and more 
common and which are known under the headword "Dolby surround", has been absolutely 
impossible up to now. 

It is the object of the present invention to provide an improved concept for evaluating the 
quality of audio signals in the case of which room effects are additionally taken into account. 

This object is achieved by a quality-evaluating system according to claim 1 and by a quality- 
evaluating method according to claim 14. 

The present invention is based on the finding that, although signals comprising an arbitrary 
number of channels exist, the human listener, who counts in the final analysis, always has 
only two ears at his disposal. Directional hearing is caused by the different impulse re- 
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sponses for different incidence directions of sound signals into the human ear. The different 
impulse responses for different incidence directions are referred to as "head-related transfer 
functions" in the field of technology. In reality, there are not only the direct sound paths be- 
tween the ear and the loudspeaker, but reflections on the walls, on the ceiling and on the 
floor occur as well. This can be summarized as room impulse response. The HRTFs and 
the room impulse response lead, in combination, to a change of sound which can, accord- 
ing to the present invention, also be evaluated by measurement systems without explicit 
modelling of binaural effects, such as different masking thresholds for binaural signals in 
comparison with monaural signals, perception of phase shifts, precedence effects, etc.. 

When audio signals are evaluated by means of listening tests, standardized listening 
rooms, which have been standardized e.g. according to ITU-R BS.1 116, are normally used. 
The size, the loudspeaker arrangement and the reverberation time are largely determined in 
this case. When a more comprehensive quality evaluation of audio signals is carried out in 
accordance with the present invention, both the head-related transfer functions (HRTFs) as 
well as the room impulse responses can be taken into account. For the listening-adapted 
quality evaluation according to the present invention it is, furthermore, of no importance 
whether a signal is a stereo signal which is emitted by two loudspeakers for the left and for 
the right channel, or whether the signal is a multi-channel signal comprising e.g. five chan- 
nels and emitted by five loudspeakers which are positioned with respect to a listener e.g. in 
such a way that the loudspeakers are arranged at the rear left, front left, rear right, front 
right and at the front. 

The quality-evaluating system according to the present invention comprises for this purpose 
a unit for converting the audio reference signal into a first audio reference sum signal at a 
first reference point and into a second audio reference sum signal at a second reference 
point and a unit for converting the audio test signal into a first audio test sum signal at the 
first reference point and into a second audio test sum signal at the second reference point, 
the audio reference sum signals and the audio test sum signals at the first and second ref- 
erence points being a superposition of the respective channels, which can be emitted by the 
plurality of loudspeakers, weighted with a respective transfer function between the respec- 
tive loudspeaker and the reference point in question. The audio reference sum signals and 
the audio test sum signals are finally fed into a quality-evaluating unit so as to obtain an 
indication for the quality of the audio test signal. The quality-evaluating unit can be an arbi- 
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trary known unit of the type disclosed e.g. in DE 196 47 399 C1 or of the type specified in 
the international standard ITU-R BS 1387 (PEAQ). 

The method according to the present invention is advantageous with regard to the fact that, 
when the audio signal is a stereo signal, the influences of the listening room on the signal 
propagation from each loudspeaker to each reference point, i.e. each ear, can be taken into 
account. 

Another advantage is to be seen in the fact that the method is applicable to audio signals 
comprising an arbitrary number of channels, since the channels are converted into two sum 
signals via respective transfer functions modelling the propagation of a signal from one 
loudspeaker to one ear, in such a way that an arbitrary quality-evaluating method, which is 
suitable for two channels, can be used. 

Normally, the individual transfer functions can be gained by measurement making use of 
built-in microphones with an artificial head or of probe microphones with a human listener. 
The method according to the present invention will, however, be particularly advantageous 
when the head-related transfer functions of arbitrary persons are already known and can 
e.g. be downloaded via the Internet from a suitable server. In this case, the room impulse 
response which occurs in a listening room and which can be measured or simulated can be 
convoluted with a specific existing HRTF so as to obtain a transfer function. This will be ad- 
vantageous especially in cases where the listening room does not yet exist, i.e. where the 
acoustic properties of a room are simulated prior to actually constructing the room so as to 
simulate the acoustic properties when e.g. concert halls or sound studios are planned and 
so as to optimize the listening room already prior to its construction. 

In the following, preferred embodiments of the present invention will be explained in detail 
making reference to the drawings enclosed, in which: 

Fig. 1 shows a schematic block diagram of a system according to the present 

invention; 

Fig. 2 shows a schematic diagram for determining the head-related transfer 

~~~~~ functions (HRTFs); and 



Fig. 3 shows a schematic block diagram for representing the situation in a real 

listening room. 

Fig. 1 shows a schematic block diagram of a system for evaluating the quality of an audio 
test signal derived from an audio reference signal by coding and decoding. The audio test 
signal and the audio reference signal each comprise a plurality of channels; each channel 
can be made audible by one loudspeaker of a plurality of loudspeakers 1 1 to 15 which are 
positioned at different positions in an at least fictitious room, two reference points 17, 18 for 
simulating the hearing being defined with respect to the positions of the plurality of loud- 
speakers 1 1 to 15. The quality-evaluating system includes a unit 19 for converting the audio 
reference signal into a first audio reference sum signal at the first reference point 17 and 
into a second audio reference sum signal at the second reference point 18 and for convert- 
ing the audio test signal into a first audio test sum signal at the first reference point 17 and 
into a second audio test sum signal at the second reference point 18, the audio reference 
sum signals and the audio test sum signals at the first and second reference points 17, 18 
being a superposition of the respective channels which can be emitted by said plurality of 
loudspeakers 11 to 15, weighted with a respective transfer function UF11 to UF52 between 
the respective loudspeaker 1 1 to 15 and the reference point 17, 18 in question. The quality- 
evaluating system additionally includes a unit 20 for evaluating the quality of the audio test 
sum signals while taking into consideration the audio reference sum signals so as to pro- 
vide an indication of the quality of the audio test signal at an output 21 . 

In the following, the conversion unit 19 will be explained. This unit comprises a plurality of 
transfer functions UF11 to 0F52, which are either the HRTFs, when an anechoic room, i.e. 
a room in which no reflections occur, is considered, or which are the whole transfer func- 
tion of the room from one of the loudspeakers 1 to 5 to a reference point 1 , 2. As can be 
seen in Fig. 1, the output signals of the loudspeakers are weighted with the respective 
transfer functions. The output signals produced when the input signals are weighted with 
the respective transfer functions are added by means of a first adder 22 so as to obtain first 
audio sum signals. Analogously, a second adder 23 is provided for the second reference 
point 18 so as to add the output signals of the transfer functions from the respective loud- 
speakers 1 1 to 15 to the second reference point 18 so as to provide the second audio sum 
signals. It goes without saying that the audio test signal as well as the audio reference sig- 
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nal are processed by means of the conversion unit 19 in such a way that the same condi- 
tions prevail for both the audio reference signal and the audio test signal in such a way that 
the unit 20 for evaluating the quality of 2-channel signals will only measure the quality of 
coding/decoding and that no other effects will disturb the measurement result. 

Although Fig. 1 shows the situation for a 5-channel audio signal, the system according to 
the present invention is also applicable to stereo signals comprising only two channels or to 
signals comprising three, four or more than five channels. In this case, it will only be neces- 
sary to add or to omit respective transfer functions. Furthermore, it should be pointed out 
that the positioning of the loudspeakers in Fig. 1 is only schematic. A correct positioning of 
the loudspeakers with respect to the reference points is shown in Fig. 2 and 3 for a 5- 
channel signal example. 

With respect to the notation of the individual transfer functions reference should be made to 
the fact that the first figure always refers to the loudspeaker, whereas the second figure re- 
fers to the reference point, i.e. reference point No. 1 (17) or reference point No. 2 (18). 

Fig. 2 shows a possible arrangement of the five loudspeakers 1 1 to 15 with respect to a 
listener 24 whose head is schematically shown in Fig. 2 in a top view. Alternatively, the 
head 24 may be an artificial head. In any case, the head 24 comprises the first reference 
point 17 and the second reference point 18, i.e. the ears 17, 18 in the case of a human lis- 
tener or the built-in microphones 17, 18 in the case of an artificial head 24. In Fig. 2, trans- 
mission paths in the anechpic room from each of the loudspeakers 1 1 to 15 to each refer- 
ence point 17,18 are depicted. The head-related transfer functions (HRTFs) are deter- 
mined by screening e.g. the head or the shoulders of the person listening and by different 
transmission times. Arrow 31a, for example, represents the transmission path from the first 
loudspeaker 1 1 to the first reference point 17. Arrow 31 b, which is depicted in the form of a 
broken line in the area of the head 24, represents the HRTF from the first loudspeaker 1 1 to 
the second reference point 18. Analogously, arrow 32a represents the transfer function from 
the second loudspeaker 12 to the first reference point, i.e. UF21 in Fig. 1. Arrow 32b repre- 
sents in a corresponding manner the transfer function from the second loudspeaker 12 to 
the second reference point 18, i.e. 0F22 in Fig. 1. By adding the sub-signals of the loud- 
speaker output signals, which have been weighted with the respective transfer function, at 
the reference points 17, 18, the first and second audio test sum signals and audio reference 
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sum signals are then obtained; these audio test sum signals and audio reference sum sig- 
nals can then be fed into an arbitrary quality-evaluating unit 22 for 2-channel signals so as 
to obtain a measure of the quality of the audio test signal, which is a 5-channel signal in the 
case shown in Fig. 2. 

As has already been mentioned, the scenario in Fig. 2 shows how the head-related transfer 
functions are gained in the anechoic room. This means that, when the HRTFs are gained by 
measurement, the room must be of such a nature that no sound reflectors exist within the 
room, i.e. that the whole room must be provided with a sound absorbing lining. 

Fig. 3 shows a schematic representation of transmission paths in a listening room 30 in 
which the loudspeakers 11, 12, 13, 14, 15 are arranged in the same way as in Fig. 2. In 
addition to the direct sound, an indirect path from each loudspeaker to the left ear 17 is 
shown here. Reference should be made to the fact that the scenario in Fig. 3 does not fully 
reflect the reality, since reflections occur here on all the walls, the floor and the ceiling and 
since multiple reflections exist as well. In detail, the first loudspeaker 11 additionally emits 
sound which, as shown by a line 31c, is reflected on the front wall of the room 30, propa- 
gates from the front wall and arrives at the first reference point 17. It follows that the transfer 
function from the first loudspeaker 1 1 to the left ear 17, i.e. UF1 1 in Fig. 1 , models not only 
direct sound propagation 31a from the loudspeaker to the ear but also sound propagation 
by means of a reflection 31c from the first loudspeaker 1 1 to the first ear 17. Analogously, 
there is also an indirect path, which is indicated by an arrow 32c, from the second loud- 
speaker 12 to the first ear 17. This means that the transfer function UF21 in Fig. 1 from the 
second loudspeaker 12 to the first reference point 17 models not only direct sound propa- 
gation 32a but also sound propagation by means of reflection to the first ear 17. 

In the following, the determination of the individual transfer functions UF11 to Uf52 (Fig. 1) 
will be explained. There are various possibilities of determining these transfer functions. 

The first possibility is to position the loudspeakers 11 to 15 relative to the reference points 
17 and 18 in the manner shown in Fig. 3. Subsequently, the first loudspeaker 1 1 is excited 
by means of an excitation signal, whereupon the sound signal arriving at the first reference 
point 17 is measured at this reference point; considering Fig. 3, this sound signal is a su- 
perposition of the signals 31a, 31c. In addition, the sound signal at the second reference 



8 



point 18 is measured; this sound signal could be a superposition of signal 31b and of a sig- 
nal which is not shown in Fig. 3 and which is emitted by the first loudspeaker 1 1 and re- 
flected on some wall or other in such a way that it arrives at the second reference point 18. 

The transfer function from the first loudspeaker to the first reference point 17 (UF1 1 in Fig. 
1) can be calculated from the excitation signal and from the sound signal measured at the 
first reference point 17. if the loudspeaker 1 1 is excited by means of an ideal pulse, the re- 
spective impulse response, which describes the transmission of the sound signal in the time 
domain, will be obtained directly at the reference points. In view of practical restrictions, this 
is, however, only a theoretical method, whereas, in practice, the loudspeaker 1 1 is excited 
by a pseudo-noise signal. This method is repeated for the additional loudspeakers 12 to 15 
in such a way that all the additional transfer functions UF21 to UF52 can be determined 
from the measured sound signal at the respective reference point and from the excitation 
signal at the respective loudspeaker. 

If, as has been stated, such measurements take place in a real space with non-absorbing 
walls, etc., the whole transfer function, which comprises the room impulse response and the 
head-related transfer functions (HRTFs) for the individual loudspeaker positions, will be 
determined directly. If such measurements are carried out in an anechoic room, i.e. in a 
fully sound-absorbing room, the HRTFs can be determined directly in this way; these 
HRTFs are then the transfer functions UF11 to UF52. 

Irrespectively of whether the measurement is carried out by means of two built-in micro- 
phones and an artificial head or by means of two probe microphones and a test person, 
such sound measurements are complicated and expensive not least in view of the very ex- 
pensive probe microphones. 

If, however, head-related transfer functions (HRTFs) are known for specific persons or also 
for an "average person", these head-related transfer functions can be used for being con- 
voluted with the impulse response of a room; this impulse response can also be simulated. 
In this case, no measurements will be necessary for determining the transfer functions 
UF1 1 to UF52. A substantial advantage of this method is that it can also be used for simu- 
lating rooms which have not yet been constructed so as to design e.g. a sound studio for an 
optimum sound propagation for specific loudspeaker configurations prior to actually con- 
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structing this sound studio. It follows that, in this case, it can no longer be said that the room 
in which the quality of a coded and subsequently decoded audio test signal is to be evalu- 
ated actually exists. Instead, the room only exists in a simulated form and is therefore a fic- 
titious room. 

Irrespectively of whether the room actually exists or whether it only exists as a fictitious 
room on the basis of a simulation, it is normally assumed that test persons are seated or 
stand in such a listening room, which may e.g. be a standardized listening room, at the best 
possible listening position. However, many test persons move their head to the front, to the 
rear, to the left or to the right while the test is taking place; this is also referred to as transla- 
tion. In addition, the persons will normally move slightly away from the optimum listening 
position, i.e. they move their heads to the left and to the right, this being also referred to as 
bearing movements or rotation. Hence, a possibly existing middle loudspeaker, i.e. the 
loudspeaker 13, will no longer be located precisely in the middle. This happens because the 
directional perception is often unsure precisely at the front. In particular, the front and the 
back are confused in many cases. This is also referred to as "front-back confusion" in the 
field of technology. Making reference to Fig. 2 and 3, it can be seen that the first reference 
point 17 and the second reference point 18 change with respect to the fixed positions of the 
loudspeakers in the case of each movement of the head. 

In order to cope with this situation, the quality-evaluating method carried out by the quality- 
evaluating system shown in Fig. 1 is executed for a plurality of positions of the reference 
points 17, 18, whereupon various quality indications for the different positions will be ob- 
tained. It goes without saying that for each of the different positions of the reference points 
17, 18 different transfer functions must be ascertained and used when the method accord- 
ing to the present invention is being executed. The output obtained will then be a plurality of 
quality indications for different positions of the reference points 1 7, 1 8, i.e. for different head 
positions. 

Various possibilities exist for evaluating the different quality indications. On the one hand, 
an average value can be assumed so as to be able to make a general statement to the ef- 
fect that a certain coding/decoding method may perhaps be optimal, if the position of the 
head is not changed at all, or that this method is less advantageous than some other coding 
method in the case of certain translations or bearing movements or rotations of the head. 
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On the other hand, the "worst case" of the individual measurements can be found out so as 
to be able to make a statement whether a certain coding/decoding method is sub-optimal in 
the case of a specific position of the head with respect to the five loudspeakers when 5- 
channel audio signals are processed. It will be advantageous to carry out such quality 
evaluations for a plurality of positions of the reference points 17, 18 close to the optimum 
reference listening position on the one hand. On the other hand, such measurements can 
also be carried out for other sites which are not located at the reference listening position so 
that e.g. certain other seats in a sound studio can be judged so as to find out whether or not 
coding/decoding errors can be heard there. 

The above description shows clearly that the system according to the present invention and 
the method according to the present invention provide existing quality-evaluating systems 
and methods with a substantial amount of flexibility in such a way that it is not only possible 
to evaluate the quality of audio signals with more than two channels but that it is also possi- 
ble to act out quality evaluations for different scenarios of positioning the reference points 
17, 18 relative to the loudspeakers 1 1 to 15, and that the system according to the present 
invention and the method according to the present invention can even be used for design- 
ing sound studios or other listening rooms, such a cinemas, so as to be able to carry out a 
listening-adapted evaluation of the quality of specific coding/decoding methods in a specific 
room. Furthermore, the method according to the present invention and the system accord- 
ing to the present invention can be used for designing listening rooms so that the optimum 
coding method among a large number of possible coding methods can be selected for a 
specific room. 

The transfer functions UF1 1 - UF52 can be realized in the field of circuit technology in dif- 
ferent ways. Preferably, they are realized through an FIR filter for each impulse response. 
Reference should be made to the fact that for large rooms the FIR filters may have a con- 
siderable length; in the case of a sampling frequency of 48 kHz their length may e.g. ex- 
ceed 100,000 sampling values. In this case, it will be advisable to represent the first milli- 
seconds of this length, where the reflections occurring are primarily discrete reflections, 
more precisely than the time domain towards the end of the filter, where the reflections oc- 
curring are primarily diffuse reflections. 
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Claims 

1 . A system for evaluating the quality of an audio test signal derived from an audio refer- 
ence signal by coding and decoding, said audio test signal and said audio reference 
signal each comprising a plurality of channels, each channel being adapted to be made 
audible by one loudspeaker (1 1 - 15) of a plurality of loudspeakers which are posi- 
tioned at different positions in an at least fictitious room (30), and two listening refer- 
ence points (17, 18) being defined with respect to the positions of the plurality of loud- 
speakers, said system comprising: 

a unit (19) for converting the audio reference signal into a first audio reference sum 
signal at the first reference point (17) and into a second audio reference sum signal at 
the second reference point (18) and for converting the audio test signal into a first audio 
test sum signal at the first reference point (17) and into a second audio test sum signal 
at the second reference point (18), the audio reference sum signals and the audio test 
sum signals at the first and second reference points (17, 18) being a superposition of 
the respective channels, which can be emitted by said plurality of loudspeakers (11 - 
15), weighted with a respective transfer function (0F11 - UF52) between the respective 
loudspeaker and the reference point in question; and 

a unit (20) for evaluating the quality of the audio test sum signals while taking into con- 
sideration the audio reference sum signals so as to provide an indication of the quality 
of the audio test signal. 

2. A system according to claim 1 , wherein the transfer functions (UF1 1 - UF52) between 
the respective loudspeakers (11-15) and the respective reference points (17, 18) are 
individual head-related transfer functions (HRTFs) so as to take into account the differ- 
ent impulse responses for different sound incidence directions into the human ear (17, 
18). 

3. A system according to claim 2, wherein the transfer functions (UF1 1 - 0F52) between 
the respective loudspeakers (11-15) and the respective reference points (17, 18) are 
mean head-related transfer functions (HRTFs) obtained by averaging a large number of 



12 



individuals. 

4. A system according to one of the preceding claims, wherein the transfer function (0F1 1 
- 0F52) between the respective loudspeaker (11-15) and the respective reference 
point (17, 18) is a transfer function which corresponds to the convolution of the head- 
related transfer function with a room impulse response in such a way that the sound re- 
flections of the room in which the plurality of loudspeakers (11 - 15) and the two refer- 
ence points (17, 18) are positioned are taken into account 

5. A system according to one of the preceding claims, wherein the transfer functions 
(0F1 1 - UF52) between the respective loudspeakers (11-15) and the respective ref- 
erence points (17, 18) are averaged transfer functions which are the result of averaging 
individual transfer functions between fixed loudspeaker positions and varying positions 
of the reference points (17, 18). 

6. A system according to one of the claims 1 to 4, wherein said conversion unit (19) is 
arranged for providing transfer functions for various positions of said first and second 
reference points (17, 18) with respect to fixed loudspeaker positions, and wherein the 
quality-evaluating unit (20) is arranged for providing the indication of the quality of the 
audio test signal for various transfer functions and for providing the positions of the ref- 
erence points (17, 18) for the indication of the poorest quality. 

7. A system according to one of the preceding claims, wherein the room (30) is a stan- 
dardized reference listening room and wherein the two reference points (17, 18) simu- 
late the auditory organs of a test person at a reference listening position. 

8. A system according to one of the claims 1 to 6, wherein the room (30) is a sound studio 
and wherein the two reference points simulate the auditory organs of a test person at 
an arbitrary seated/standing position in said room. 

9. A system according to one of the claims 5 to 8, wherein the different positions of the 
first and second reference points (17, 18) deviate only slightly from a reference position 
so as to simulate a bearing movement of a test person. 
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10. A system according to one of the claims 5 to 8, wherein the different positions of the 
first and second reference points deviate markedly from the reference position so as to 
simulate a rotation of the head of a test listener. 

1 1 . A system according to one of the preceding claims, wherein the audio test signal com- 
prises five channels, said five channels being a left rear, a right rear, a left front, a right 
front and a middle front channel. 

12. A system according to one of the claims 1 to 10, wherein the audio test signal is a ste- 
reo signal. 

13. A system according to one of the preceding claims, wherein the conversion unit (19 ) 
comprises: 

an FIR filter for each loudspeaker/reference-point combination, the filter coefficients of 
each FIR filter being determined by the transfer function of the transmission path from 
the respective loudspeaker to the respective reference point; 

a first adder (22) for the first reference point (1 7) for adding the output signals of the 
FIR filters (GF1 1 - 0F51), which represent transmission paths to the first reference 
point (17), so as to provide the first audio test sum signal and the first audio reference 
sum signal, respectively; and 

a second adder (23) for the second reference point (18) for adding the output signals of 
the FIR filters (GF12 - UF52), which represent a transmission path to the second refer- 
ence point (18), so as to provide the second audio test sum signal and the second 
audio reference sum signal, respectively. 

14. A method for evaluating the quality of an audio test signal derived from an audio refer- 
ence signal by coding and decoding, said audio test signal and said audio reference 
signal each comprising a plurality of channels, each channel being adapted to be made 
audible by one loudspeaker (1 1 - 15) of a plurality of loudspeakers which are posi- 
tioned at different positions in an at least fictitious room (30), and two reference points 
(17, 18) being defined with respect to the positions of the plurality of loudspeakers, said 
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method comprising the following steps: 

converting (19) the audio reference signal into a first audio reference sum signal at the 
first reference point (17) and into a second audio reference sum signal at the second 
reference point (18); 

converting the audio test signal into a first audio test sum signal at the first reference 
point (17) and into a second audio test sum signal at the second reference point (18); 

weighting the respective channels, which can be emitted by said plurality of loudspeak- 
ers (11 - 15), with a respective transfer function (UF11 - UF52) between the respective 
loudspeaker and the reference point in question; 

superimposing the weighted channels at said first and at said second reference point 
(17, 18) so as to obtain the audio reference sum signals and the audio test sum sig- 
nals; and 

conducting the audio test sum signals and the audio reference sum signals to a unit 
(20) for evaluating the quality of the audio test sum signals while taking into considera- 
tion the audio reference sum signals so as to obtain an indication of the quality of the 
audio test signal. 

15. A method according to claim 14,wherein the following step precedes the step of con- 
verting (19): ^ 

obtaining the individual transfer functions (UF1 1 - UF52) between each loudspeaker 
(11 - 15) and each reference point (17, 18). 

16. A method according to claim 15, wherein the step of obtaining comprises the following 
sub-steps: 

exciting a loudspeaker (11 - 15) with an excitation signal; 
measuring the signal at each reference point (17, 18); 
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determining the transfer function between the excited loudspeaker and the first refer- 
ence point (17); 

determining the transfer function between the excited loudspeaker and the second ref- 
erence point (18); and 

repeating the steps of exciting, measuring and determining until all the loudspeakers 
(11-15) have been excited so as to obtain the individual transfer functions. 

17. A method according to claim 16, wherein the first and second reference points (17, 18) 
are the ears of a human listener. 

18. A method according to claim 16, wherein the first and second reference points are built- 
in microphones of an artificial head. 

19. A method according to one of the claims 16 to 18, wherein the excitation signal is 
pseudo-noise signal. 

20. A method according to claim 15, wherein the step of obtaining comprises the following 
sub-steps: 

accessing a head-related transfer function (HRTF) for a determined positioning of a 
loudspeaker (11-15) relative to the first reference point (17); 

determining the room impulse response for the position of the loudspeaker in the room; 

convoluting the head-related transfer function (HRTF) with said room impulse response 
so as to obtain the transfer function from said loudspeaker to the first reference point 
(17); 

repeating the steps of accessing, determining and convoluting so as to obtain the 
transfer function (0F1 1 - 0F52) from said loudspeaker to the second reference point; 
and 
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executing the steps of accessing, determining, folding and repeating for each additional 
loudspeaker so as to obtain all the individual transfer functions. 

A method according to claim 19, wherein the room impulse response is determined by 
simulating the room. 
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S/cond Inventor's _siao£ture 


Date 




Wohnsitz 


Residence 

Nuernberg , Germany 






Staatsangehongkett 


Citizenship 

German 


Postanscnrift 


Post Office Address 

Alf ons— Stauder— Strasse 77 




D-90453 Nuernberg, Germany 



(Biite entsprechende tnformationen und UnterscMften im (Supply similar information and signature for third and sub 

FaUe von dritten und wekeren MHerfmdem angeben). sequent joint inventors,) 
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.A 




German Language Declaration 


VERTReTUNGSVGLLMACHT: Als benaantec EWinder 
beauftrage ich hiermrt den nachsiehend benanntcn Palen- 
(anwait (oder die nacftsiehend benannten Paientanwaite) und/ 
Oder Patent-Agenlen mit der Vertotgung der vortiegenden 
j Paientanmeidung sowie mit der Abwicklung aller damit ver- 
I buodenen Goschafte vor dem Palenl-und Warenzeichenamti 
1 (Name und Regtstraaonsnummer anfuhren) 


POWER OF ATTORNEY: As a named inventor 1 her&hx, 
appoint the following attorney(s) anchor agent(s) to prosecute 
this application and transact ait business in the Patent and 
Trademark Office connected therewith, (fist name and reg- 
istration number) 

Michael A. GLENN, Reg. No. 30,176 
Donald M. HENDRICKS, Reg. No. 40,355 
Kirk D. WONG, Reg. No. 43,284 




Christopher PEIL, Reg. No. 45,005 


\ Tel^Cpngesprache bitte nchten an: 
l (tiaf^e und Teiefonnummer) 


OirecJ Telephone Cails to; {name and telephone number) 


\ Po$j§nschritt: 


Send Correspondence lo: 
GLENN PATENT GROUP 

3475 Edison Way, Suite L, 



Menlo Park, CA 94025 
U.S.A. 



Voj^r Name des einztgen oder urspruogticften Erftnders: 


Full name of sole or first inventor 
SPORER, Thomas 


Uftterschrift des Extiadecs Da(um 


Inventor's signature Oate 




Residence 

Fuerth, Germany 


Staatsaogehongkeit 


Gttzenstiip 

German 


Postaoschrifl 


Post Office Address 

Wilhelmshavenerstrasse 15 




D-90766 Fuerth, Germany 


Votter Name des zweiien Mitorfirwers (latts zutreffeod) t> r . 


Full name of second joint tnvenfor, tf any 
__BITT0, Roland 


Unterschrift des Erftnders Datum 


Second Inventor's stao^ture Date 

,?T£J frit, Au * ust 13 « 2001 


Wohnsrtz 


Residence r 
Nuernberg , Germany y£ 


Staatsangehoogkett 


'duzensivp — ""^ 
German 


Postanschrift 


Post Office Address 

Alfons-Stauder-Strasse 77 




D-90453 Nuernberg, Germany 



(Bitte entsprechende Informatioaen and Untersctuiften irn (Suppty simitar information and signature tor third and sub- 

Fatte von datten und weHeren Mitedindem angebea). sequent joint inventors.) 
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German Language Declaration 


V6RTRETUNGSVOLLMACHT: Als benannter Eriinder 
beauftrage ich h'termit den nachstehend benannten Paten- 
tanwatt (oder die nac^stehend benannten Pa(entanwalte) und/ 
Oder Patent-Agenten mit der Verfolgung der vorfiegenden 
PatentanroeJdung sowio mil der AbwickJung alter damit ver- 
bundenen Geschafte vor dem Patent-und Warenzeichenamt: 
(Name und Regtstrs::onsnummer anfuhren) 


POWER OF ATTORNEY: As a named Inventor, 1 hereby 
appoint the following attomey(s) and/or agent(s) to prosecute 
this application and transact all business in the Patent and 
Trademark Office connected therewith. (fts( name and reg- 
istration number) 

Michael A. GLENN, Reg, No. 30,176 
Donald M. HENDRICKS, Reg, No, 40,355 
K i r k D - WONG . Rea . No . 43,284 




Christopher PEIL, Reg. No, 45,005 


Tgibfongespracne bitte richten an: 
Ofkfcne und Tefefonnummer) 


Direct Telephone Calls to: {name and telephone number) 


P^itanschrift: - 


Send Correspondence to: 
GLENN PATENT GROUP 

3475 Edison Way, Suite L, 




Menlo Park, CA 94025 



U.S.A. 



filler Name des (kitten Miterfinders, falls zutreffend 


Full nans of third joint inventor, if any 

BRANDENBURG . ^rlhpnn. „ 


0hterschrift des Erfinders Qatum^H/ v 




Wohnsitz 


Residence — ^ f 
JBrlangen , Germany I>t> ^ 


Slaatsangehorigkeit 


Citizenship 

German 


Postanschrift 


Post Office Address 

Haagstrasse 32 




D-91054 Erlangen, Germany 


Voller Name des vierten Miterfiiiders, fells zutreffend 


Full name of fourth joint inventor, if any 


Unterscnrift des £ritndars Oatum 


Fourth inventor's signature 0alQ 


Wohnsitz 


Residence 


Staatsangehorigkett 


Citizenship 


Postanschrift 


Post Office Address 







(Bitte entsprechende taformationen und Unterschdften im (Supply similar information and signature for ttmi and sub- 

Fatie von dritten und weiteren MHerftndem angeben). sequent joint inventors.) 
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